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(57) Abstract 

This invention relates to mycobacterial species-specific reporter mycobacteriophages (reporter mycobacteriophages), meth- 
ods of producing such reporter mycobacteriophages and the use of such reporter mycobacteriophages for the rapid diagnosis f 
mycobacterial infection and the assessment of drug susceptibilities of mycobacterial strains in clinical samples. In particular, this 
invention is directed to the production and use of luciferase reporter mycobacteriophages to diagnose tuberculosis. The mycobac- 
terial species-specific reporter mycobacteriophages comprise mycobacterial species-specific mycobacteriophages which contain 
reporter genes and transcriptional promoters therein. When the reporter mycobacteriophages are incubated with clinical samples 
which may contain the mycobacteria of interest, the gene product of the reporter genes will be expressed if the sample contains 
the mycobacteria of interest, thereby diagnosing mycobacteria] infection. 
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mCQPftCTBR IAIi SPECIES-SPECIFIC PEPQRTER 
tTCCQPACTERIOPBACES 

STATEMENT OF GOVERNMENT INTEREST 
This invention was made with government 
support under NIH Grant Number AI26170. 

FIELD OF THE INVENTION 
This invention relates to- mycobacterial 
5 species-specific reporter mycobacteriophages (reporter 
mycobacteriophages) , methods of making such 
reportermycobacteriophages, and the use of such 
reporter mycobacteriophages, for example, to rapidly 
diagnose mycobacterial infection and to assess drug 
10 * susceptibilities of mycobacterial strains in clinical 
samples. Specifically, this invention relates to the 
use of mycobacterial species-specific luciferase 
reporter mycobacteriophages to diagnose tuberculosis 
and to assess the drug susceptibilities of the various 
15 strains of Mycobacterium tubQECUlPS X's 

(Mi tuberculosis) - 

To produce the mycobacterial species-specific 
reporter mycobacteriophages of the invention, 
transcriptional promoters and reporter genes are 
20 introduced into the genomes of mycobacterial 
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species-specific mycobacteriophages . These reporter 
genes may be the genes for luciferase or the 
B-galactosidase gene, and provide the DNA which 
encodes production of a gene product. The reporter 
5 mycobacteriophages may be used by incubating same with 
samples which may contain the specific mycobacteria of 
interest. If the mycobacteria of interest is present, 
then the reporter mycobacteriophages introduce the 
recombinant nucleic acids which encode expression of 
10 the gene product into the mycobacteria of interest, 
and the mycobacteria then express the gene product. 
The expressed reporter gene product may be detected by 
a suitable assay, for example, through the detection 
of photons or the conversion of an easily assayable 
15 chemical reaction. The presence of such gene product 
indicates that the sample contains the mycobacteria of 
interest, and hence the mycobacterial' species-specific 
reporter mycobacteriophages may be used to detect and 
thereby diagnose the specific mycobacterial 
20 infection. In addition, since, signals may not be 
generated by cells which are not metabolically active 
in the presence of antibiotics, the mycobacteria 
species-specific reporter mycobacteriophages of this 
invention may be used to assess the drug 
25 susceptibilities : of various strains of mycobacteria. 
. If antibiotic drugs are added to the sample containing 
' the; reporter mycob and the gene product 
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is detected, the mycobacteria is metabolically active 
and hence resistant to the antibiotic drug. 
BACKGROUND OF THE INVENTION 
In 1990, there was a 10% increase in the 
5 incidence of tuberculosis in the United States. In 
addition, there has been an increase in the appearance 
of clinical isolates of tuberculosis that are 
resistant to antibiotics used to treat the disease. 
This problem is exacerbated by the length of time that 

10 is currently needed both to diagnose tuberculosis, 

and to . determine the drug susceptibilities of various 
strains of M. tuberculosis . As a result, patients 
with M. tuberculosis may remain infectious for long 
periods of time without being treated, or may be 

15 treated with a drug to which the bacterial strain is 
resistant. Therefore, a need has arisen in the field 
for a method of diagnosis of M. tuberculosis (and 
other mycobacterial infections) which is rapid, 
sensitive and specific, which method is also capable 

20 of assessing the drug susceptibilities of the various 
strains of M. tuberculosis and other mycobacterial 
strains. It is critical that a mycobacterial strain 
be assessed for drug resistance rapidly because a 
patient infected with a strain of M. tuberculosis or 

25 another mycobacteria must be treated immediately with 
the particular antibiotic drug(s) to which the strain 
is not resistant ^, and not with antibiotic drug(s) to 
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which the strain is resistant, or the patient may die. 

Currently, the most rapid test available for 
the diagnosis of M, f-nharculosis is the staining of 
sputum samples for acid-fast bacilli, which is a 
5 tedious procedure, and which procedure has high 
sensitivity. Alternative methods for diagnosis 
require cultivation of the bacilli for approximately 
two to six weeks followed by classification of the 
cultured organism. Typical diagnostic tools include 
10 biochemical tests, analysis of mycolic acids and 
serotyping. All of these, tests are time-consuming. 
More recently, the use of oligonucleotide probes and 
Polymerase Chain Reaction have been suggested for the 
•identification »f pi- friiharculosis species. Although 
15 these methods may be useful approaches, their uses in 
a clinical setting have not yet been determined. 
Further, these methods do not distinguish between- live 
and dead organisms, and . are therefore of limited use 
in the determination of drug sensitivities of clinical 

20 isolates. 

In addition, MyroharteHum avium ( M- avium ) is 

a mycobacteria which is often found in 

immunpsuppressed patients. This mycobacteria is 
typically disseminated throughout the bodies of 
25 iramunosuppressed patients, such as AIDS patients, and 
/causes m. avium infection. Because this mycobacteria 
often causes death in. immuno suppressed patients, it is 



necessary to be able to diagnose and assess the drug 
susceptibilities of the various strains of M. avium . 

It is therefore an object of this invention to 
construct broad mycobacterial host range and 
mycobacterial species-specific reporter 

mycobacteriophages . 

It is another object of this invention to 
provide mycobacterial species-specific reporter 
mycobacteriophages which may be used to rapidly 
diagnose mycobacterial infections. 

It is still another object of this invention 
to provide mycobacterial species-specific reporter 
mycobacteriophages which may be used to rapidly assess 
the drug susceptibilities of different strains of 
mycobacteria in clinical samples. 

It is yet another object of this invention to 
provide mycobacterial species-specific reporter 
mycobacteriophages wherein the reporter genes are 
lucif erase genes, which mycobacterial species-specific 
reporter mycobacteriophages may be used to rapidly 
diagnose mycobacterial infections and to rapidly 
assess the drug susceptibilities of various strains of 
mycobacteria. 

It is a further object of this invention to 
provide mycobacterial species-specific luciferase gene 
reporter mycobacteriophages which may be used to 
rapidly diagnos tuberculosis and assess the drug 
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susceptibilities of the various strains of 
M. tuberculosis. 

SUMMARY Of THE INV ENTION 

This invention relates to broad host range and 
5 mycobacterial species-specific reporter 

. mycobacteriophages, (reporter, mycobacteriophages) , 
methods of producing such reporter mycobacteriophages, 
and the use of such reporter mycobacteriophages to 
rapidly diagnose mycobacterial infection, such as 
10 m HihPrculosis , and to distinguish which strains of 

the mycobacteria are drug-resistant. To produce these 
reporter mycobacteriophages, reporter genes and 
transcriptional promoters are introduced into the 
genomes of mycobacterial species-specific 
15 mycobacteriophages. The promoter and reporter 

gene-containing : mycobacteriophages (reporter 
mycobacteriophages) are then incubated with a 
clinical sample which may contain the mycobacteria of 
. interest, such m, ^.hArmilosis . The reporter 
20 mycobacteriophages are specific for the mycobacteria 

which is . sought to be detected. The reporter 
mycobacteriophages efficiently introduce the 
recombinant . nucleic acids which encode the expression 
of the reporter gene's gene product into the 
25 mycobacteria of interest, and the mycobacteria then 
express the gene product. A substrate or other means 
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capable of allowing for the detection of the gene 
product is then added to the sample. If the gene 
product or the signal generated by the gene product is 
detected, the presence of the infectious mycobacteria 
5 is known, thereby diagnosing the disease. To assess 

drug susceptibility of mycobacteria, drugs such as 
antibiotics ma-, be added to a sample containing the 
reporter mycobacteriophages of this invention. If the 
mycobacteria are susceptible to a drug after exposure 

10 to the drug, the mycobacteria will be killed. 

However, drug-resistant mycobacteria will continue to 
be metabolically active in the presence of the drug, 
and will continue to express the detectable gene 
product of the reporter genes. 

15 The preferred reporter genes of the present 

invention are the Firefly lucif erase lux gene (FF lux ) , 
the luciferase lux genes of Vibrio fischeri . the 
luciferase lux genes of Xenorhabdus lum inescens and 
the E. coli B-galactosidase gene ( lac Z) . The 

20 preferred promoters of the present invention are hsp60 
and L5 gene 62 promoter, and the preferred 
mycobacteriophages are L5, TM4 and D56A. These 
reporter mycobacteriophages are preferably used for 
the rapid diagnosis of tuberculosis and M. avium 

25 infection, and the accurate assessment of drug 
susc ptibilities of the various strains of 
M. tuberculosis and j». flvimn. 
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BRTFF DESCR IPTION OF THE DRAWINGS 
The above brief description, as well as 
further objects and features of the present invention, 
will be more fully understood by reference to the 
5 following detailed description of the presently 

preferred, albeit illustrative, embodiment of the 
present invention when taken in conjunction with the 
accompanying drawings wherein: 

FIGURE 1 represents the genome organization of 
10 mycobacteriophage L5; 

FIGURE 2 represents a luciferase shuttle 
plasmid pYUB180 wherein reporter gene FFJLjis. is fused 
to the BCG hsp60 promoter; 

FIGURE 3 represents the amount of luciferase 
15 activity of m. smeomatis which contains the pYUB180 
shuttle plasmid and the FFJjis gene; 

FIGURE 4 represents the effect of various 
antibiotic drugs on the metabolic activity of control 
mycobacteria and drug resistant mycobacteria in the 
20 presence of reporter mycobacteriophages which 

contain luciferase reporter genes; 

FIGURE 5 represents shuttle plasmid phAE39 
wherein the reported gene is FFlax, the promoter is 
hsp60, the phage is TM4 and the cosmid is pYUB216. 
25 FIGURE 6 represents luciferase activity of 

M . smeamatis cells infected with shuttle phasmids 
phAE39; and. ■ ^ . .. , : , t . . , .. 
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FIGURE 7 represents a flow chart for cloning 
different promoters into TM4 : : lux shuttle phasmid 
phAE39. 

DETAILED DESCRIPTION OF THE INVENTION 

This invention is directed to mycobacterial 
5 species-specific reporter mycobacteriophages, 

(reporter mycobacteriophages), methods of producing 
such reporter mycobacteriophages and the use of such 
reporter mycobacteriophages for the rapid diagnosis of 
mycobacterial infections and the accurate assessment 
10 of mycobacterial drug susceptibilities* 

In order to produce such reporter 
mycobacteriophages, mycobacterial species-specific 
raycobacteriophage genomes are modified by introducing 
therein transcriptional promoters and reporter genes 
15 whose gene product can be sensitively detected. The 
reporter mycobacteriophages may then be incubated with 
clinical samples suspected of containing the 
mycobacteria of interest, either directly of after 
culture, and the samples tested for the presence of 
20 the reporter gene product, thereby diagnosing 
mycobacterial infection. 

The method of this invention allows for rapid 
diagnosis because only the amount of time necessary 
for the reporter mycobacteriophages to infect their 
25 host cells and the amount of time n c ssary for the 
host cells to synthesize the reporter gene product are 
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requir d to allow for diagnosis. Typically, the 
amount of time required for the reporter 
mycobacteriophages to infect their host cells and for 
the host cells to synthesize the reporter gene product 
5 is between ten minutes and sixteen hours. 

The assessment of drug susceptibilities with 
the reporter mycobacteriophages of this invention is 
accurate because the reporter mycobacteriophages only 
allow for the detection of metabolically active 
10 mycobacterial organisms, the presence of which 
metabolic activity indicates that a drug has not 
killed the mycobacteria and that the mycobacteria is 
resistant to the drug. 

To enhance diagnosis specificity, a series of 
15 similar reporter mycobacteriophages, each of which 
having well-defined but different specificities for 
mycobacterial species, is selected. 

Mycobacteriophage L5, a temperate virus with a 
broad host-range among mycobacteria, is the most 
20 thoroughly characterized of the mycobacteriophages. 
L5 particles are morphologically similar to the family 
of phages that includes phage g and contain a linear 
dsDNA genome with cohesive ends. The inventors have 
determined the DNA sequence of the entire gene as well 
25. as several gene functions. The DNA sequence of the L5 
mycobacteriophage is as follows: 
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***SEQUENCE*** 

1 GGTCGGTTAT GCGGCCGAGC CATCCTGTAC GGGTTTCCAA GTCGATCAGA GGTAGGGGCC 

61 GGCACAGAAA CCACTCACAT CAGGGCTGTG CGCCTCCAGG GCGCGTGAAC TCCCACACCC 

121 CGGTGTAGTT ACATCCCGGA ATTGTCTCAG CGCCTCTCAG GGCGCTTCTC ATAAACAGTG 

181 ATCTACGCCA CTCCTGACGG GTGGCTGTCA AGGATACTCA CCTTCCCTAC TAATGAGGGG 

241 CTAAGAGCCC CTCTCTATAG AGCGCCGCAC AGGCGGCGCG ATAAGAGCGC CACCAGGCGC 

301 TCATCTAAAG ACCGGCCTTG AAGGGCCGGT CATAGAGATC TATTCGATCC GGCAACCGCC 

361 GGATCTCAAG GCCGCGCCAG TGCGCGGCCC TATAGAGGGG TGACTCAACT GTGCATGGCA 

421 CTCGCTCGAG TGCCCACTGG AGCACTCAAC CGGGGAAGTT CGACGTTCTC AACCTGCGAA 

481 TGACGTTTGA ATCGTCATCC GCGTACGAAA TCCCCGATCT GCGGCCGACC GACTTCGTGC 

541 CGGCCTATCT CGCGGCCTGG AATATGCCGC GTCACCGCGA TTACGCCGCC AAGAACGGCG 

601 GCGCGCTGCA CTTCTTCCTT GACGATTACC GGTTTGAGAC CGCGTGGTCG TCCCCCGAGC 

661 GCCTTCTCGA CCGCGTAAAG CAGGTCGGCG CTGCACTCAC GCCGGATTTC AGCCTCTGGA 

7 21 CGAACATGCC GAAGGCGGCG CAGCTATGGA ACGTCTACCG CTCCCGCTGG TGTGGCGCGT 

781 ATTGGCAGTC GGAAGGAATC GAGGTGATTC CGACGGCGTG TTGGGCGACT CCCGACACGT 

841 TCGATTTCTG TTTCGACGGG ATCCCGATGG GATCGACCGT CGCAATTTCT TCGATGGGCA 

901 TTCGCTCTTC AAAAGTCGAC CAGGAGCTTT TCCGGTACGG ACTACGCGAA CTCATCGATC 

961 GCACTCAACC GCAACTGCTT TTGGCATATG GCCAGCTTCG GCATTGCGAC GACATGGATT 

1021 TACCAGAGGT CCGCGAATAC CCGACCTACT GGGACAGACG ACGAAAGTGG GTAACTGCCG 
ATGGGAGGCC GGGGAAGTAA AGGCGGCCCC GGTCCCGGAA CCGGAGCACG CAACCGCAGA 

11*1 GGCGCTGGAG CCCCCGGATC GGGCGGCGTA GGCGGCGTCG GAGGCGGGGG TGGAGCTGCA 

1201 GGGAGCAGCG GAGGCGGCAA GGGAACGGCA GCGCCGGTAC CGGAGGCGTC ACCGGTGGCG 

1261 GCGGAAGTGG AGCCGGCGGC GGTGGCAGCA GCCCCAACAC CCCGGTGCCC CCCACCGAGC 

1321 TGGAGAAGAA GCGCGGCGAA TACAACCAGA TCGCCATCGA CGCCCAGAAA CAGCACGCGC 

"81 CCACCGATGA GAAGCGCGAG GCCAAGCGCA AGCAACTGAT GGATCGAGTC GGAGGAGACT 

i J GGCAGGCTTT GGACCCGGAT CACCACGACG CCATCAAGGT GGCGATGGAT GACGCCATGC 

1501 GGAAGATCCT CTCCGAGGAG GAGATCGTCC ACCGCACCAA GCACTTCGGC GACCTACTCG 

1561 ACTCCGGTCG ACTCAAGTCG CTGTTCGAGG TCGGCTTCTC AGCCGGTGGC GACACCCCGA 

1621 CCGAACGCGC CCTCCTCGAG GACGCCTGGT TCGGCGCAGG CAAGGTTCCC CCGATCTACT 

1681 CGGCAATCGA GTTCAACGGC GCTCCGACAG CCGGCCTCGG CATGTACGGC GGCACCAAGC 

1741 TCTACATGAA GG ACTCGGTC AAGGACCGCG T'CACCGTGAC CATCGGCGAC TCGCTGATGT 

1801 CGAGCTGGGA CGTATTCCCC GGCCGTCCTG GCGACGGCGT GGGGCTGTGG GCCAGCCTGT 

1861 CGAAGATCGA GGGGCTGGTC GATCCGAGCA AGACCCGCGA AGAGAACATG CAGGCGGTGT 

1921 ACGACTCGTT CAAGAAGTAC GGCACCCTGG ACGGCTTCAT CGAGGCGCAG ATCCACGGCG 

1981 GCGTCCTGGT CGAGGACATC AAGAAGGTCG TGTTCACGCA GCGGCCGAGC CCGATCTTCA 

2041 CCGATAAACT GGACGAACTT GGAATCCCGT GGGAGGTGCA GTAATGGCGC AGATGCAGGC 

2101 GACACACACA ATCGAGGGGT TCCTGGCTGT CGAGGTGGCC CCTCGGGCGT TCGTCGCAGA 

2161 GAACGGCCAC GTACTGACCC GGCTGTCGGC CACGAAGTGG GGCGGTGGCG AGGGTCTCGA 

2221 GATCCTCAAC TACGAGGGTC CAGGGACCGT CGAGGTCTCC GACGAGAAGC TCGCCGAAGC 

2281 CCAGCGGGCC AGCGAGGTCG AGGCTGAACT TCGCCGCGAG GTCGGCAAGG AGTGAGCTGG 

2341 'GCCGGCTCAG GCCGGCGACA GGAACTACCA GAGGACTGGG AGCTGAATTA CCGGCTCCCG 

2401 GTCCTTTCTG CTGCCAACTG GCTTTGCCAG ATCAACGGTC CCGGATGCGT AAGGGCCGCA 

2461 ACCGATGTCG ACCACATCAA. GCGCGGGAAC GACCACAGCC GGTCCAATCT GCAGGCAGCC 

2521 - TGCCATGTCT GTCACGGCAA GAAATCAGCC GCCGAGGGCG TAGCCCGACG GCGGGAACTT 

2581 AGAGCCCGGA GGAAGCGACC ACCCGAACGC CATCCTGGGC GTCGATAAGC GGGCCAGGTG 

2641 CCCGCTCCAC CCAGGAGGTG AACAGTGGGC ACGCGAGGCC CAATCGGAAA ACGAGATGAA 

2701 GAGCGGGTTC GTCGGAACAC CCCGGACAGT CCAACCGACA CGATCCAGAT GCCCGGTCTG 

2761 GTGACGATCC CCGAGATGGG CGATCTAAGC CACGACCGCC GCACGCACCA GCTCGTCAAG 

2821 . GACATGTACG AGTCGATCAA GCAGTCGGCA GCCGTGAAGT ACTACGAGCC GACCGACTGG 

2881 CAGATGGCCC GACTCGCCCT CTACACACTT AACCAGGAAC TCATCGCAGC CGAGAACAAC 

2941 GGCAAGCCCG TGGGCGCGAT GAAGCTCACT GCCATCAACC AGATGCTCTC CGCGCTGCTG 
r.CTGACCGAAG . .GTGACCGACG CCGCGTCCGA CTCGAAGTCG AACGAGCACC CGCTGACCCG 
;;;}: ACAGGCGGGA AGGTCGTTGA CGTGACCGAC GTGCTCAAGC AGCGCCTCGC CAAGGCGAGC 

a i vGGCGGG AGCT GATGGTCCCC CGAGGGGTTT CTAGAGCCGC TGCCGCTACC AGCCGCTCCC 

-3181 CCTCGGGGTA GACATCGAAA GGAACCACAT GGCCGACCTC GGCAACCCAC TCGACCTCGA 
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3241 GATGCTCTGC CTGGTCACAG GCCGGGACTT CCGCTGGACC ATCGATTACC CGTGGGGTCC 
3301 GGGAGAGCTG TTCCTCGAAC TCGAGACCGG CGGCGAACAC aacgcgctgc atcaggtcta 
3361 TGTCACCGGG GCGACCGGAG GCACGTACAC GCTGAACGTC aacggcacca acaccccggc 
3421 OICMCTAC AACGACGTGT CGGAGAATCC GCAGGGGCTG GCAGGCGACA TCCAAGACGC 
34 81 TCTGGACGCA GCCGTCGGAG CCGGAAACGC TGTCGTGCAT ccggtctcgc tgttccctgc 
3541 gtggacactg aacttcaacc tcaacgccag caagccgctc Jccgagcagt tggtcaacac 
3601 gatcaacaag gccgcgaacg acttcttcga cacgttcgac caactacttg gggtcgacgt 
3661 ggagatgacg gtcaccgaca ccctgaactt caagctcaag gtgacctcgc ggcgctcgtt 
3721 cgatgaggtc ggtgtcgtca cgttcgcggt cgacgtgacc agccaggcag tcatcaactt 

3781 CTTCAACTCC GTCGCCGAAC TCACCGGAGC GGTGAACAGC GTCAACGTCG ACTTCTACTG 
3841 GAACCGGACG TATGACATCG AGTTCACCGG WCCCTTGGG CTGCAGCCGA "CCGGCTAC 
7Q<11 T&CS.BCCGAC ATCACCAACC TGGCGGGTAC CAGCAAGGCC GTCTCAGTCA CGGTGGTCGA 
3961 GCCAGGAAAG AAGAGGCTGA cStCTGGCC GTTCACGGTC AACGGTGAAA CCGCAACCAT 

Jo" Smqtcom SS! cSacaagat ccccaaccgc tgccgctggc agttggttca 

4661 CATGCCGACC GGCGAGGCAG CCGGCGGCGA TGCAAAGCAG CTCGGCCGCG TTTACCGACA 

4141GXCGAGGTM caccgcaccc atcagagatg gtgggccaga cggccttcgg gccgtcccct 
4201 gacgtgtagc tcaatggcag agcgcccgac tgttaatcgg gtggttgaag gttcgagtcc 

4261 TTCCATGTCA GCGAGGGCTG AACCGGACCC GTGTCCGGTG TAGGCACTTT CCGCAGGCGG 

4321 TTCCCCAGAG cgtggggagc ccctgccctg wcacgtagc tcaaxtggta gagcagcggt 

4381 CTCCAAAGCC GCCGGTTCCA GGTTCGACTC CTGGCGTGTA TGCACACACC CCTGACTCCT 
4441 flTTAflCGGAG TGTTCGCCTT TCGGGCCTGG GGTCTTTTTC CCCGTTCGTC TAATCGGTAA 

i&Sggggg tctSSmom attSroMO ttcgagtcct JGGCGGGGAG ccaacttgac 

4561 ATCCACCCGA AAGGAACAAC ATGACCTTCA CAGTCACCCG CGAGAGAGCG CAGTGGGTCC 

4621 Sgacatggc CCGCGCTCGC gacggtctcc cctacgcgta cggcggggcg ttcaccaaca 

4481 ACCCGAGGGT GTCGACTGAC TGCTCTGGCC TGGTGCTGCA GACCGGGGCT TGGTATGGAG 
4741 GTCGCACCGA CTGGGTCGGA AACCGTTACG GCTCAACCGA ATCGTTCCGG CTCGACCACA 
4801 AGATCGTCTA CGACCTAGGG TTCAAGCGGA TGCCCCGAGG CGGGCCAGCG GCCTTGCCGA 

486iSE8g1t gSgSSg?c GGGCTCCAGC acggaggcgg cggggtctac. tcgcacaccg 

492lS^CACGTT GATGACGATG GACCACCCCG GTGGCCCGGT CAAGATGTCC GACCGAGGCG 
4981 TCGACTGGGA GTCCCACGGC AACCGCAACG GCGTAGGCGT CGAACTTTAC GAGGGCGCAC 
. 5041 GGGCATGGAA CGACCCTCTG TTCCATGACT TTTGGTACCT GGACGCAGTC CTCGAAGACG 
5101 AAGGAGACGA TGACGAATTG GCTGACCCAG TTCTAGGGAA GATGATCCGC GAGATCCACG 
5221 CGTGCCTGTT CAATCAGACC GCGTCGACCA GCGATCTGGC GACCCCTGGT GAAGGCGCTA 
. 5281 TCTGGCAGCT ACACCAGAAG ATCCACTCGA TTGACGGCAT GCTCCACCCG MCCACOCTG 
5341 AGCGGCGCGC TCGCGCAGGC GATCTCGGTG AGCTGCACCG AATCGTGTTG GCCGCGAAGG 
5401 GCTTGGGCGT GAAGCGCGAC GAGGTGACCA AGCGGGTCTA CCAGAGCATC CTCGCCGACA 
5461 TCGAGCGGGA CAACCCCGAA GTACTTCAGC GATACATCGC AGAAAGAGGT GGCCTATGAG 
5521 CCCCAAGATC CGACAGACCA TCTACCTGCT CGGCACCGCC GCCCCGGCAC TGCTGGGCAT 
5581 CGTCCTGATC TGGGGCGGGC TCGACGCTGA GTCGGCGGCT. GACCTCGGTG ACATCA.TGC 
5641 ^CGTCGTG TCGATACTAG TCTCCGGTGC GCCGGCCGTA GCGGCAGGCA CCGTACGCAG 
5701 Sagcgcaag GACGGCACGT TGTCCACCAG CCCGGTGGAT CAGGTCACCA agggcgtcga 
5761 GCAGGTGCTC GCGGCCAGGC AGAGTGCCGA GGCTGAAGTC fCGAAGGTCA JGCAGGCGCT 
5821 QGAGACCGCC GTCAGCGGTT CTCTCCCGCA GCTCGGCCCG. CTGGCCACGC AGATCCTCAA 
5881 SoGCTGAC GACACCGTCT GGCGTCCATG AGCAAGCCCT GGCTGTTCAC CGTCCACGGC 

5941 ACAGGCCAGC CCGACCCGCT CGGGCCTGGT CTGCCTGCCG ATACCGCACG GGACGTACTT 
6001 GACATCTACC GGTGGCAGCC CATCGGCAAC TACCCGGCAG CGGCGTTCCC GATGTGGCCG 
6061 TCGGTCGAAA AGGGTGTCGC TGAGCTGATC CIGCAGATCG AGCTGAAGCT GGACGCAGAT 
6121 CCGTACGCGG ACTTCGCGCT GGCCGGCTAC TCGCAGGGAG CCATCGTGGT GGGCCAGGTG 
6181 CTCAAGCACC ACATCATCAA CCCGAGAGGT CGACTGCACC GGTTCCTGCA CCGGCTCAGG 
6241 AAGGTCATCT TCTGGGGTAA TCCGATGCGG CAGAAGGGCT TTGCCCACAC CGACGAGTGG 
6301 ATTCACCAGG TCGCTGCCTC GGACACGATG GGCATCCTCG AGGACCGACT GGAGAACCTC 
6361 GAGCAGTACG GCTTTGAGGT CCGCGACTAC GCGCACGACG GCGACATGTA CGCCTCCATC 
6421 AAGGAGGACG ACATGCACGA GTACGAGGTG GCCATTGGCC GAATCGTGAT GAGCGCTAGG. 
6481 CGATTCATCG GAGGTAAGGA CTCCGTCATC GCCCAGCTCA TCGAGCTTGG ACAGCGTCCG 
6541 ATCTGGGAGG GAATCGCGAT GGCCAGAGCC ATCATCGACG CCCTCACGTT CTTCGCCAAG 
6601 TCGACCCAAG GCCCGAGCTG GCCGCATTTG TACAACCGCT TCCCGGCGGT CGAGTTCCTA 
6661 CGACGAATCT GAGAAAGGAG GCGGGGTGAG CCTCAACAAC CACCACCCGG AGCTTGCCCC 
6721 GTCTCCCCCT CACATCATCG GCCCGTCCTG GCAGAAGACG GTCGATGGTG AGTGGTATCT 
6781 GCCTGAGAAG ACCCTCGGCT GGGGAGTCCT GAAGTGGCTC TCCGAGTACG TGAATACCCC 
TGGCGGGCAT GACGATCCGA ACCGTCTGGC GACGTTGATC GCGCTCTCCG AGGCAGGTCT - 
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TCTCGACAAC GAGAACATGT TCATCCCCAC CGACGAGCAG GTACGCCTGG TCCTCTGGTG 
GTACGCAGTA GATGACCAGG GCCAGTACAT CTACCGCGAG GGCGTGATCC GCCGGCTCAA 
GGGCTGGGGC AAGGATCCGT TCACCGCCGC GCTCTGCTTG GCGGAACTCT G7GGCCCCGT 
AGCCTTTTCA CACTTCGACG CCGACGGTAA CCCGGTCGGC AAGCCGCGTT CAGCCGCGTG 
GATCACCGTC GCGGCCGTCA GCCAGGACCA GACGAAGAAC ACGTTCTCGC TGTTCCCGGT 
GATGATCAGC AAGAAGCTGA AGGCCGAGTA CGGCCTGGAC GTGAACCGCT TCATCATCTA 
CTCCGCAGCC GGTGGCCGTA TTGAGGCAGC GACCTCGAGC CCCGCGTCGA TGGAGGGTAA 
CCGCCCGACG TTCGTCGTCC AGAACGAGAC GCAGTGGTGG GGCCAAGGCC CCGACGGCAA 
GGTCAATGAA GGCCACGCGA TGGCAGAGGT CATCGAAGGC AACATGACCA AGGTCGAGGG 
CTCCCGCACC CTGTCGATCT GCAACGCCCA CATCCCCGGC ACCGAGACGG TCGCCGAGAA 
GGCATGGGAC GAGTACCAGA AGGTCCAGGC AGGCGACTCT GTCGACACCG GGATGATGTA 
CGACGCGCTG GAAGCGCCGG CCGACACCCC GGTCTCCGAG ATCCCCCCGC AGAAGGAGGA 
TCCCGAGGGA TTCGAGAAGG GCATCGAGAA GCTCCGCGAG GGCCTGCTCA TCGCCCGAGG 
CGACTCCACC TGGCTGCCGA TAGACGACAT CATCAAGTCG ATTCTGTCGA CCAAGAACCC 
GATCACCGAG TCGCGGCGCA AGTTCCTGAA TCAGGTAAAC GCCGCTGAGG ACTCGTGGCT 
CTCACCGCAG GAATGGAACC GGTGCCAGGT CGACCTGGCC AAGTACCTGG ATAAGCACGG 
CAGGGAGTTC GCTCCGCTGC AGCGCGGTGA CCGGATCACC CTCGGGTTCG ACGGGTCGAA 
GTCCAACGAC TGGACCGCGC TCGTCGGCTG CCGTGTCAGC GACGGCCTGC TGTTCGTCAT 
CGACATCTGG GATCCCCAGA AGTACGGCGG GGAGGTTCCC CGCGAAGACG TTGACGCCAA 
GGTCCATTCG GCGTTCGCCC ACTACGACGT GGTGGCGTTC CGCGCCGACG TGAAGGAGTT 
CGAGGCGTAC GTCGACCAGT GGGGCCGGAC CTACAAGAAG AAGCTCAAGG TCAACGCCAG 
CCCGAACAAC CCGGTGGCGT TCGACATGCG CGGACAGCAG AAGAGGTTCG CGTTCGACTG 
CGAGCGACTC GAGGACGCGG TCCTTGAGGG CGAGGTCXGG CACGACGGCA ATCCCGTTCT 
GCGCCAACAC GTTCTGAACG CCAAACGACA CCCAACGAAC TACGACGCCA, TCGCGATTCG 
CAAGGTCACG AAGGACTCCA GCAAGAAAAT CGACGCTGCA GTCTGCGCTG TCCTCGCGTT 
CGGGGCGAGA CAGGACTACC TCATGAGCAA GAAGGCCCGT AGCGGCCGGG TGGTGATGGT 
TCGATGACAG CACCGCTCCC CGGTATGGAG GAGATCGAAG ACCCCGCAGT CGTACGAGAA 
GAGATGATCT CGGCCTTCGA GGATGCTTCC AAGGATCTCG CCAGCAACAC CAGCTACTAC 
GACGCTGAGC GCCGGCCAG A GGCCATCGGC GTCACCGTCC CGAGAGAGAT GCAGCAACTG 
CTGGCTCACG TCGGATACCC CAGGCTCTAC GTCGACTCAG TCGCCGAGCG CCAGGCCGTC 
GAGGGTTTCC GCCTCGGCGA TGCCGACGAG GCTGACGAAG AGCTGTGGCA GTGGTGGCAG 
GCCAACAACC TCGACATCGA GGCACCACTG GGCTACACCG ACGCTTACGT TCACGGCCGG 
TCGTTCATCA CGATCAGCAA GCCAGACCCG CAGCTCGACC TGGGTTGGGA TCAGAACGTC 
CCGATCATCC GCGTCGAGCC GCCCACCCGA" ATGCACGCCG AGATCGACCC CCGGATCAAC 
CGGGTGTCCA AGGCCATCCG AGTCGCATAT GACAAGGAGG GCAACGAGAT TCAGGCTGCC 
ACGCTGTACA CGCCGATGGA GACCATCGGC TGGTTCCGCG CTGACGGTGA GTGGGCTGAG 
■fGGTTtAACG TCCCGCACGG TCTGGGCGTC GTTCCCGTTG TGCCGCTTCC GAACCGGACC 
CGGCTCTCGG ACCTGTACGG CACCAGTGAG ATCACGCCCG AGCTTCGGTC GATGACCGAC 
GCGGCGGCGC GCATCCTCAT GTTGATGCAG GCGACCGCCG AGCTGATGGG TGTCCCCCAG 
CGCCTGATCT TCGGCATCAA GCCCGAAGAG ATCGGCGTCG ACTCCGAGAC CGGCCAGACG 
CTGTTCGATG CGTACCTGGC CCGGATCCTG GCGTTCGAGG ACGCTGAGGG CAAGATCCAG 
CAGTTCTCTG CAGCCGAGCT GGCCAACTTC ACCAACGCGC TCGATCAGAT CGCCAAACAG 
GTCGCTGCGT ACACGGGATT GCCTCCCCAG TACCTGAGTA CCGCCGCAGA CAATCCGGCC 
TCCGCTGAGG CGATCAGGGC CGCTGAGAGC CGACTCATCA AGAAGGTCGA GCGGAAGAAC 
CTGATGTTCG GCGGCGCATG GGAAGAGGCC ATGCGGATCG CCTACCGGAT CATGAAGGGC 
GGCGACGTTC CCCCGGACAT GCTCCGCATG GAGACCGTCT GGCGAGACCC GAGCACTCCC 
ACCTACGCGG CCAAGGCCGA CGCAGCCACG AAGCTGTACG GCAACGGCCA GGGTGTCATC 
CCGCGTGAAC GTGCTCGCAT CGACATGGGC TACTCCGTCA AGGAGCGCGA AGAGATGCGC 
CGATGGGACG AGGAAGAGGC CGCAATGGGT CTCGGCCTGT TGGGCACGAT GGTCGACGCC 
GACCCGACGG TCCCAGGCTC CCCGAGCCCC ACGGCACCGC CGAAGCCACA GCCGGCCATC 
GAGTCGTCTG GTGGTGATGC GTGACCGCAG AGGAGTACGC GGCGGCTCAA GCCGCGATCA 
CTGCGGGTCT TGCCACATAC GTCCAGAGGT TCGCTTCGCT CTTCGTCGGT CCAGCTCTCG 
CTGTAGGTGA GTGGCTGCGA CTGCTGCAGG TGCTGTTCCC CGAAATCCAA CGGCGGTATG 
CAGATGCTGC CGCCTTGGGC AGGGACTTCT ACGACTCCCA ACGCGCACTA CACCACCCAG 
AGCTGCCCCG GAACGAGAGG TTCCGGGGAG AGCTTCGGTG GGAGTGGTTC GTCCAGAACA 
TGGAGCCCGC TCGAAAAGAG ATGTCGCAGG CCGACTCTCC GCCGAGTGCG ACCTCTAAGT 
TGGCTCTGGC CGCAGTTCGC GAAGTGGAGA TGGCAGCACG CCGACAGATC ATCGGCGCTG 
TCAAGAACGA TCCGGCCCCG CAGATCGTGC AGGGCTGGGC GAGGGTCGCC ACCGGGCGCG 
AAACATGCGC CTGGTGTCTG ATGCTCATCT CACGGGGTGC CGAGCTGAAT CACAAGGGCA 
ACTTCGCCTA CAGCTCAGCG GAAGCCGCAG GGCTCAACCT CGATGACGAG ACCGTGATCG 
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104 41 ACCTCTGGAA CGAGTCCGGT CACGACCTTG AGAAGTTCCG CGAGGAGACC AGAGAGGACT 
10501 TCGAGAAGTG GCACGCAGGG TGCGACTGTC TGGTGGTCCC GGTCTTCGAT GTGCAGAACT 
10561 GGCCCGGAAG AGACGCTGCC CTACGGGCGC AGCAACTTTG GATCGAAGCC AGCGACGAAG 
10621 CTGACGACCT CATTGCGTCA GGCAAGGCCC GCTCCAAGAA CAAGAACACG GAGACGCTCA 
10681 ACGCGCTCCG ACGCCGCCTA GCACGCGGCG AAATCACCAT GTCCAACTAC GCCCTCGCTG 
10741 CGTAGTCCCT CGAACCCCAG GTGGGTTCTC TCAACATGCC CAGGAGGCGA AAACACATGT 
10801 CCGACAACCC CACTCCCGAG AGCACCCCAG AGGCCGAGAC CCCGGAGGTC GAGAAGCCGA 
10861 TGGAACCGCA GGGCAAGGTC TTCGATGAAG CGTACGTTCA GTCGCTTCGC CAGGAGGCTG 
10921 CAGCCGCTCG GGTGGCGAAG AAGGACGCCG TAGAAGCGGC AGAGGCTCGA GTGAAGGCCG 
10981 AGTACGAGGC CAAGCTCGCT GAGCGCGACA CCGCTTACAC CGAACTGCAG AACCAGTTGG 
11041 GACAGGCGTG GATTGAGCTG GAGAAGGTCT . ACCTCTCTCT CGACGCCAAG GTGCCCAACG 
11101 ACAAGGTTCG GGCGTTTGTC GAGATCCTCG AAGGCAACGA CAGGGACAGC ATCGCTGAGT 
11161 CAGTGAAGTC CCGTCTGGAG CTGGTCGGCG GATTCGGCAA CAAGACCCCG AGTCCTGCGT 
11221 TCGACCCGTC TCAGGGTCGC GGCGGTAAGC CGCCGATCCC GCTGAACGGT GACCCGATCC 
11281 TCGAGGCCAT CAAGGCCGCT GTCGGGATCA AGAAGTAACC CACCCAACAG ATCTCAAGGA 
113 41 GAGATAAACA ATGGCAGTCA ACCCTGACCG CACCACGCCG TTCCTCGGCG TGAACGACCC 
11401 CAAGGTCGCG CAGACCGGCG ACTCGATGTT CGAGGGCTAC CTCGAGCCCG AGCAGGCCCA 
11461 GGACTACTTC GCCGAAGCGG AGAAGATCTC CATCGTCCAG CAGTTCGCCC AGAAGATCCC 
11521 GATGGGCACG ACCGGCCAGA AGATCCCGCA CTGGACCGGC GACGTGAGTG CGTCGTGGAT 
11581 CGGTGAAGGC GACATGAAGC CCATCACCAA GGGCAACATG ACCTCGCAGA CCATCGCCCC 
11641 CCACAAGATC GCGACGATCT TCGTGGCCTC GGCGGAAACC G7CCGTGCGA ■ ACCCGGCCAA 
11701 CTACCTGGGC ACCATGCGGA CCAAGGTCGC GACCGCCTTC GCGATGGCGT TCGACAACGC 
11761 CGCGATCAAC GGCACCGACA GCCCGTTCCC GACCTTCCTA GCGCAGACCA CCAAGGAGGT 
11821 CTCGCTGGTG GACCCGGACG GCACCGGCTC CAACGCCGAC CTCACCGTCT ACGACGCGGT 
11881 CGCCGTCAAC GCCCTGTCGC TGTTGGTCAA TGCCGGCAAG AAGTGGACCC ACACTCTGCT 
11941 GGACGACATC ACCGAGCCGA TCCTCAACGG CGCGAAGGAC AAGAGCGGTC GCCCGCTGTT 
12001 CATCGAGTCG ACCTACACCG AGGAGAACAG CCCGTTCCGC CTCGGTCGGA TTGTGGCCCG 
12061 TCCGACCATC CTGAGCGACC ACGTCGCCTC GGGCACGGTC GTCGGCTACC AGGGTGACTT 
12121 CCGCCAGCTC GTCTGGGGCC AGGTCGGCGG CCTGTCCTTC GACGTGACGG ATCAGGCGAC 
12181 TCTGAACCTG GGCACCCCCC AGGCTCCGAA CTTCGTCTCG CTGTGGCAGC ACAACCTCGT 
12241 CGCAGTCCGA GTCGAGGCCG AGTACGCCTT CCACTGCAAC GACAAGGACG CGTTCGTCAA 
12301 GCTCACGAAC GTGGACGCCA CCGAAGCCTG ATCCAGGCTT GACATCCACC GGGAGGGGGC 
12361 TCCTTCGGGA GCCCTCTCCT GATGTGGAGC AGGAAGGACC ACATGCGAAT CCAGTCCACC 
12421 CTCAACGGCG GTTTCGCCGA GGTTTCCGAG GAGTTCGCCA AGCAGTTGAT CGCCACTGGC 
12481 GGCTGGAAGG TGCCCCGGAA ACCGCGCAAC ACCAAGACCA AGACCGCTCC TGAGGAGCCC 
12541 AAGAACGAGG AGTAACCCGT GGCCTACGCG ACCGCCGAAG ACGTTGTGAC GTTGTGGGCC 
12601 AAGGAGCCTG AGCCCGAAGT GATGGCGCTG ATCGAGCGCC GGCTCCAGCA GATCGAGCGC 
12661 ATGATCAAGC GCCGGATCCC CGACCTGGAC GTGAAAGCCG CTGCGTCGGC GACGTTCCGG 
12721 GCCGATCTGA TCGACATCGA AGCTGATGCT GTTCTGCGCC TCGTGCGTAA CCCGGAGGGC 
12781 TACCTCTCGG AGACCGACGG TGCGTACACC TATCAGCTCC AGGCCGACCT GTCGCAAGGC 
• 12841 AAGCTCACCA TCCTCGATGA GGAGTGGGAG ATCCTCGGGG TCAACTCCCA GAAGCGCATG 
12901 GCGGTCATCG TCCCGAACGT GGTGATGCCG ACGTGAGCGC GAGCGACCGA CACCGCGCCC 
12961 CGATTGTCTA TCCGCCTGGC ACXCAGGCGG TTACGCCGGA TCGGGTCAAC GCGTTTGACT 
13021 GCGATCACGA AGCTGATCCT CCGGTGTGCC GGTGCGTCCA CGACTGGCGC ATCGAGTGGG 
13081 GAAACGTCAA GAAGGCCACC GCCAGATCAC GGTCGGCGGT GCTCTGATGA GCCTCCTCGA 
13141 CACCGGTGCC CGGTACCAGA CCTGCATCGT CTACCCCGAA GAGATGGTCA TCGACTCCGA 
13201 TGGCAACAAG CGGACCAGGC CGTCGAATAC CGGCATCCCG GCCATCGCAC GGTTCCAGGT 
13261 AGCCAACCAG TCTGGTACGT CGGCACGACG TGCTGAGCAG GACAACGAGG GGTTCGAGAC 
13321 CGAGAAGGTC TACCGGATGC GGTTTCCCCG CTCGTTCACC AAGGAGCACG GCATCCTCGG 
13381 GGCCCAGTCC CAGATCGAGT GGCGAGACCA . GCGGTGGGCG CTCTTCGGAG ACGCCACCGT 
13441 CTACGACTCA TCCCCTGCGT TGGCGCGGGT CGACTACACG ATCAAGAGGT ACTGATGGCC 
13501 AAGGTCTACG CGAACGCGAA CAAGGTCGCG GCCCGGTACG TCGAGACGAG GGACGCCGTC 
CGAGACGAGC GGAACAAGGT CACCCGTCGA GCCAAAGCCA ATCTGGCGCG GCAGAACTCG 
ACCACCCGCA TCACCGACGA GGGCTACTTC CCGGCCACCA TCACCGAGCA AGACGGCGAT 
GTCGACTTCC ACACGATCCT CAACGCGCCC AACGCGTTGG CGCTTGAGTT CGGCCACGCG 
CCGTCTGGCT TCTTCGCTGG CACCGACACG AAACCACCGG AGGCCACTTA CATCC7CACC 
HI*} CGAGCCGCCA TCGGCGGCAC CGTCTCATAA GGAGGTCACA TGGCGCG AAT GCCTCGCGTC t 
"25, CAGGCAGTAG CGGCCCCGAT CCTCCGGTCA GACCCCCGAC TGGAGGGAGT GACGGTCACG 
13921 ACATGGGTTC CAGACGTGGA CTTCCGAGAG TTCCCGATGA TCAACCTCCG CCGCATAGGC 
13981 GGCACGAGGA ACCCCAACGC ACCGACGCTG CACACGCTGC CGGTGGTCGA AATGACCGCC 



ty\ iocttti rrr outrtrr 



WO 93/16172 



PCT/US93/00913 



15 



14041 

14101 

14161 

14221 

14281 

14341 

14401 

14461 

14521 

14581 

14641 

14701 

14761 

14821 

14881 

14941 

15001 

15061 

15121 

15181 

15241 

15301 

15361 

15421 

15481 

15541 

15601 

15661 

15721 

15781 

15841 

15901 

15961 

16021 

16081 

16141 

16201 

16261 

16321 

16381 

16441 

16501 

16561 

16621 

16681 

1-6741 

16801 

16861 

16921 

16981 

17041 

17101 

17161 

17221 

17281 

17341 

17401 

17461 

17521 

17581 



TACACCAGAG 
TACGACGCGG 
ACGATGGGCG 
ATCAGGCTCG 
CTGAAAACGA 
GCACCGCTGC 
GGACCGGTGC 
TCGGCTTCGA 
AGATCACCAC 
AGTCGCTGGG 
AGACCGGCCA 
TGGGGCATCA 
ATGACCTGGC 
TCTCCTGGAT 
TTGACAGCCA 
CCCTCCTCCC 
CACCATCGAC 
GTCCGACGAT 
AGCGGTGGTC 
CGACGAGTTG 
GCTGATCGCC 
TATCCGCGCA 
GGAAGCCGCG 
CTCCAGTACT 
AGATTCGTTC 
CGTGGTGGGC 
GACGCCATCC 
AAGCCCAAGG 
CCGGGTTCGT 
AGGGAGGAGG 
CAGCCCGAAC 
GGAGCTTAAG 
CTTCAAGCGC 
CGACGTAACC 
CAGCCGGGGG 
CC7TGGCAAG 
CGTAGCAGCT 
GTCTGCGTTC 
CGCCTCGACG 
GCAGGGACTC 
CCTGCAGAAC 
CCAGGCTCCT 
GGGCCTCGGC 
CGCGAACTCG 
CATGGTCAAC 
GACGCTGGGC 
GGGACAGCTC 
GCTGATGCCG 
CACACAGCTC 
GCTCGGCACG 
TACGTTGATC 
GCTCATGCAG 
CCCGGCGCTG 
GATCATCTCG 
TCCGACCATC 
GGCGCAGGCT 
CGCGCTCATC 
GTCCGAGTGG 
ACTGCCGGGG 
TAAGGATCTC 



ACGGTCTCAT 
TGGAGAACGG 
CCAC7CAGTT 
GCGTCCGCAG 
CGACGCAGTG 
TCCTACGCCG 
TACCGGTTGG 
AGGCGGCGAG 
CGAGGATCCC 
TCTGTACTAC 
GACCAACGAG 
CGCCCACAAG 
TGCGCTGCCC 
CAACGAAGAC 
CCCGGCTGTC 
GCCACTCACA 
GCATTCCGCG 
GTGACCGTCG 
GAGGTGTTCA 
GTCGATGAGT 
ACGAAGCCCA 
GAGCTGTATG 
CCCTCGCCGA 
ACCGGGTAGA 
TGTCCCTGGT 
AGCAGTACCG 
AGGCGGGCAA 
CACCCAAGTC 
TCGCCGCAAT 
AGAGTGCCGA 
ACCAAGGAGT 
GGCGATGTCG 
ATGATGATGC 
GTCGACAAGA 
CTCGGAGATC 
TCGTTCCTGG 
CCGCTGGTCG 
GGAGCCGGCG 
CTGGCCKGA 
ACCCCGGTGT 
GTGGCCTCGG 
GGTCTGCAGC 
CCTGTGCTCG 
TTCGGCACGC 
CGAGTCACGT 
AGCGTCCTCA 
GGCGGTCCGC 
GCGCTGACTT 
GCTCCCATCG 
ATGCTCACCG 
GGCACGACGC 
AGCTTCCAGC 
GCGACGGCCC 
ACGTTGGTTC 
GTCAACCTGG 
CTGGTCAGCG 
GGCGCGCTGG 
GTCAGCAGCT 
"ATGATCCAGT 
6TCCAGGGCC 



CGAGACTGAG 
AACACAAACT 
CAGCTCCCTC 
ACCGAGAACC 
TTGACTGCGG 
GCCTTGCTCA 
ACGAGCGTCG 
TCCGAGGTCA 
ATCGACTACG 
GGCCCCAACG 
AAGGCCGTGC 
GCTGGAGTTC 
GTCCGGTTCA 
CTCTTCAACG 
TACCCCGGAG 
GACCCGCCGA 
AAGAGGTCAA 
AGCTGAAGCC 
AGGAGTTCGC 
ACTCGCTCCA 
AGAAGCTGAT 
CAGCGGTACT 
GCTGATCGAC 
CCTGCGCGAC 
GCTCTGCCTT 
GGGCTG6ACC 
CCACATCCTG 
ATACCCGCGT 
GGTCGTGCGA 
ATAGTGCTGG 
TCCGCCGGGA 
AGATCAACGG 
AGCTCAAGAC 
AGAGCAAGAA 
TAGGCGATGA 
GCCTCACACG 
GCATCGTGGC 
CTGGCGTAGT 
CGCTGGAGAC 
TCCAGCAGCT 
GCCTCGTGAA 
AGATCCAGAA 
CTACCGGCAC 
TCCTGGCXCC 
CCAACGGCGT 
ACCTGTTCAA 
TGTCGACGTT 
CGGTCTCTGG 
TCACGGCGCT 
GAGCCCTCCA 
TGAACACGGC 
AGATCTCCGA 
TCGGCCAGGT 
CGGCGTT CCT 
TCCAGTCGTT 
TTGCTGGCGC 
CGAACCTCAC 
TCTCCAGCGG 
CGGCTCTCGC 
TGATCAACGG 



GAGCTGTACG 
CCCGCAGGGT 
TACCAGGACT 
ACCCTCTAAC 
CGGTCGGCTA 
AGACCATCGA 
GCCACACCAG 
AGGGCTCCTG 
TCACGGTCCT 
CCTCTGAGAC 
TGGTCGTGAT 
GCCGCGACGA 
CCTACCTGGA 
TGCCCGAGGT 
GGGGAGGTTT 
CACTGAAAGG 
GAAGAAGTAC 
GCTGCTGAAG 
GGACATCCCC 
GGTCTGCGAC 
CGCCGCCTTG 
CAACACCTGG 
AAGTTCGGCG 
CTGTTCCGCG 
CCCAAAGACG 
GAGGACCGCT 
CTGCTGGCGA 
CCCGACGACC 
GCGAAGAAGG 
CGTAGAAGTC 
ACTCAAGACC 
TCATCTCGAT 
CGAAGCTGCC 
GGGAGGTCTC 
CGCCGAGAAG 
AGCCGCCTGG 
CGGTCTGCTG 
CGCGCTCGGC 
GGTCAAGGCC 
CGGCCCGATG 
CATGGCCGGG 
CATCCTCACC 
GCAGGCGTTC 
CCTGCAGGAG 
GTTCGAGGGT 
CCGGCTCATG 
CATCAACGGG 
TCTGATCGGC 
CACGCCGGCC 
AGCTCTGGGT 
GCTGCAGGCT 
CGTACTGGTG 
CGCAGGCGCG 
TCAGTTGGTC 
CGCCAACCTG 
GGTGATTCAG 
GGAGATCATC 
AGCCCAGCAG 
CAACCTGATG 
CATCGGCGGG 



AGACCGCGCT 
ATTTGACCTC 
CCTGGCGCAT 
CGAAAGGTAA 
CGTGTACGTC 
CCTCAGCAAG 
CCGAGGCACG 
GCAGAAGAAG 
ACTGCACCAG 
TCCTGGTGTG 
CGAAGACGGC 
CGCGATTGAG 
CCACGAAGAC 
TCCCGAGGGC 
CCTTGGCGGG 
TTCGCCATGA 
GCTGCGGTCC 

CTGGGCCAGA 
GACCTCGAAG 
ATCATCGCCA 
GACGAGGA&C 
AAGCGAGAGA 
GGGCGATCCT 
ACGAGGATCC 
GCGCGTTCTA 
ACGCGCTCGC 
ATCGTGATCC 
TAGAGAAGAC 
CGGCtCGAGA 
GCCCGGATCT 
GAACTCGAGA 
GCGGCCCAGG 
AAGGGCGTTC 
CTCGGAGGTC 
GCGTCGTCTC 
ATAGGCGTAG 
GCCGGTCTGC 
ATGGACGGCA 
GCTGTCTCCT 
CtGACCGCGA 
TCGATCACCG 
AAGACCGGAG 
CTGACGCTGT 
TTCACCAACG 
GCCATGCAAG 
GAGTCCGGTC 
TTCGGAGATC 
AACGTCCTCG 
TTCCAGACGC 
CCGATCCtGA 
CTCCAGCCGA 
ACCAGTCTGG 
GTGCXGCAGC 
CCAAAGGTCG 
ATGCCGGTGG 
GTGGGTGTCT 
TCCAACGTCA 
ATCGCTGCGA 
GCCATCGGCC 
ATGGTCAGCCi 



AGAGGTTCTC 
CATCTTCGAG 
CCAGGGTCTG 
AGCCACATGG 
GGTGCTGCAG 
CCCGAGACCT 
CTCCCTGAGT 
AAGCTCCGCG 
TTCGATGAGC 
TTCGGTGTGA 
GACATGCGCC 
CTGCCCATCG 
GAGCTGCCGT 
TGATCCCAAC 
CCTGGCCTCC 
CAAACGTATT 
TCATCGGCCT 

AGGCCCGCGA 

AGGACGACGA 

AGGCGTTCCG 

CGGATCCCCG 

CGCAACTGGG 

CGCAGACCTG • 

GCTTTCGCCG 

CGCAGAACGT 

GGACATCTAC 

GAAGAAGGCA 

CACACCGAAG 

GAGAAGGGAA 

CGGTCAAGGT 

AGATCGAGCG 

CCAAGGCCGA 

ACGTCCCGGT 

TCCTCGGCGG 

AAGTACAACA 

GCATCGTGGC 

CGTCGCTGCT 

TCAAGGCAGC 

CGACGTTCCA 

TCACCCCCAA 

ACGTGATCAC 

AGTTCTTCAC 

CCAACGCCGG 

GC TTCA ACGA 

GGCTTTCGCA 

TGCAGGCGAT 

TCTTCGTCTC 

GGACGCTGGG 

TGGCGAGCAC 

CTCAGGTCGC 

TGCTGCCGTC 

CCCCGCACAT 

TCGCTCCGAC 

CTGAGCTAGT 

TTCTGCCCCT 

CCATCGGCGG 

TCAAGAAGGT 

AGGCAGCGGA 

TGCAGGCCGG. 
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CAAGGCCAAG GAGCTGGCGT CCAGCGTGGC TGGTGCAGTG AAGGGCTTCC TGGGCATCGA 
GTCCCCGTCG AAGTTGTTCA CCGAGTACGG CCAGTTCACC GCCGAGGGAT TCGGCAACGG 
CATGGAGGCA GGGTTCAAGC CCGTCATCGA ACGGGCCAAG GATCTCGCGG CTGAGCTGTC 
CAGGGCGATG GAGTCGGGCA CCGACCCCTC CGGGATTCTC 



GCTGAAGCAG ATGCTGGCGG CTCTCGAAGA GGAGCGCAAG 
CGGTATCCCC AAGGGAGACA AGGCAGGCCG AGAGGCGCTG 



GCTGGGCTGG ATCAGAATGA 
CGACTCAAGG TCGAGAAGAA 
CAGAACCAGC TCGACCAGAT 
CCAGGCGCAG AAGGACATCC TGTCCTACCA GCGTGACCGC ATCAAGAACG AGTCTGAGTA 
CGGCGACATG GCCGGCGAAG ACCCGTTGGT GAAGGCAGCC TCCGGGCTGA TGAGCGCACC 
GGTCGACTTC GCGAAAGCGA CTGGCAAGCA GTTCCTTTCG GACATCGGCA TCAGCGGAGA 
TGGGTTCATC TCGAAGGCCA TCACCGAGGG CATCCAGTAC ATCTTCCAGA TCGGCTCTGT 
CGATGAGGCG CTGTCGATCA AGGACCGCGA GGAGTCGAAG AACGCGCTGT CCGTCGTTGG 
CCGCTGACTT GACATCCACC AGGAGGTAAG CATTGATCAC CGACACCATC GTTGAACTCG 
AGGGTGTCAA TGGTGAGCGT TTCAACTTGA CGACCGGTGA CCAGGGTGTG TACCTGGCCA 
CAGACGTGGA GGGTTGTTTC TACGACCCTC CCGTCAAGGT CGTTGTTGAA GAGCCGGGGA 
ACTACCCCGG CGCTCGCTAC TTGTCCCACC GAGCCCTGAA GCGAGACATC GTCTTTGGGG 
TCGTCATCCT CAACGACGCG AAGCAGGGGC CGCGCTCCTG GCTGTCGCGA GACTCCGAGT 
GGCGCAAGGC GTGGGCGTTC AACCGCACCT GCAAGCTCTA CGTCACCACC CCGGACTCCG 
GTACCCGCTA CCTGAAGCTG GCGCTGTTCG AGTCCCCCAC CGTCAAGATG GACACCGACC 
CAAGAGGTAA ACCCCTTGAG GTCACGGTGA TGTCGTGCAT CGCGTACGAC CCGTTCTGGT 
ACGAGGACGA CAAGGTCTTC TCGGCCAAGA CCAAGACCGA CACCCGGTTC GACCCGTCGT 
TCTGGACGCC GCCGTGGCCG TGGGAGGAAC TGCCCAAGGA GACGCTGCGG ATCAAGGTCG 
GCCGCGAGCA GGGTGGGCTA AACCCCACCG ACCAGTACAT * CTTCCCGAAG TGGACCGTTC 
CCGGCTCCAC CGAGAAGGTG CCGAACTTCC CCTGGCCGTT CCCCCCGAAC GTCCCGATCC 
CGTGGGAGAC AGGACCGTTC ACTCAGTTCG TCATCCCGGA CTACTCGTTC GAGGATGAGG 
AGTTCCGCAA CCGCCGGCTC AAGACGCCGG GGTTGATCTA CGGCGAGAAC TGCGTCATCG 
ACACCGACCG GCGCGAGGAG CAGATCGCTT CCGAGTCGGG CTCCCCGGTG TGGGCTCGGA 
TGAACGGTGT CCGGTTCCGC AACTCGATCC CGCCCTACAC CGAAGAGGCT GAGTTCGTCA 
TAGACGCATC GGGATGCGCT CCGGGACAGG TAGTTACCCT CCGGCTCACG AGGCCGTGGT 
CGCGCTGCTG GGGGCTAGAG TGAGTGGTCT GACGAGCGTT CGTGAGGCCG AAGATCTCTG 
GCAGAAGATC CAATTGCGGC GCTGCAAGCG CGAGCAGGAA CGGCTCAAGC ATCCCGACGT 
AGAGCTGCGC GATGGCGACT TCCGCCTGCG CGGCCTGGTC GCTGGCGAGC GGGTGCTCGA 
GTGGGAGTTC ATCGAGAACG AGACTGGCAC CTGCACCTTG CAGCTCTCAC TGAGCCATTA 
CCTGGCGAAG TGGGTGATGG ACCACCGGGG TCGAGCAAAG CGCAACGTCA TCATCAACAT 
CGAGAAGCAA GGCGCTCGAT GGACCGGGAT GATGGACCAC TACCGGGTCA TCAAGACCGA 
CGCAGGGGAC GCCTACATCG AGATCGTGTT TTTGCACGAC TTCGAGCAGA CCAAGCATAT 
CCGGGTATGG TGCAACCCGT TCCTACGCCC CGAGCTGCAG TTCCCCAAGG TGTGGATCAT 
CTTCGGGCCG GCCAAGTGGT GTTTGCTGGT GACACTGTTC GTCAACCTGC TCAGGCTCGA 
GACGAGCTTG TGGACGCTGC CTGATGACCC CACGGACATC AACGAGTGGA TGGGTCCGAG 
CTTCAACCCA GCAAATTGGC GGAACATCGT CAAGCCGTTC CCGTTCCTGG CCGACAACTC 
ACCGGTCACG ATGGTGTTCA GCCGGTTCGG GACGTTCTAC GACACCGCCA AGAAGATCCT 
CdAAGACCAT CAGCTCACGC TGACGTGTCG TCGGTACATC AAGGACCGCG ACCCGCATCC 
GTTCGAAGAT CTCAAGGGGC TCTGGGGAAT TGATCCTGTC GAAGACCTGC TGCAGAAGAT 
CCCGCTCCGG GACGGCTGCG TGGTCTGGGA CATCGAGGAC AACTCAGGTT GGGGCACTCA 
GACCGCGTTC GGCGGTTCGT GGCTGACCGG GTTCGTCCGA GGGATGGTCC AACTGGCCGG 

cgacggccag gtcgagggcg tcgatgtgtt caccggggac tacacgttcc caggcgagta 
ctactccccc tggttcatgg gcaccagccc gatagcaccc cacgtcgtgt tcgaagaagg 
accgctgacc gggatcaagt cgtcggagtt ctcgtactac gaggccaccg acaccagctt 
cctggctggt ggacagagcg cacctggcat caacgagggc atctcggccc tggtgaacat 
cggtggcgac ctgctgacct cgttcatcaa cagccagctc gccgcgctcg gcgcggtcgg 
tggagcgatt gacctcccgc ctctgggcgg tctgctcgat gcggtgttgc agcctctgta 
ctccgatgtg ttcggcgcgt tcatggaagt tccgactctg cgtgcgatgg gcatctcgct 
cccgatctcc gggctcgagg acatcgtcac cggactgggc gacttccact acttcgagaa 
catggccgac ggggcgatga aggcgttcac gctgtcagcg ttcgcagcca tcgcatcgca 
gatccacaag acgagggctc gaacgaccca caccctcaag gtgtctgacg ccgctccgta 
catcttcgcg ccaaagccct acgggcactg ctggatcgga gatcgcgtcg gcacgtcggt 
cctcggctac ccggtcgagc accagttgtt cgtggagcgc atccgcaagg tgaagtaccg 
catcgacaaa gacggcatga agccgttgga .gatcgagatc ggttaccgcg aaccgaagaa 
cccagcacta cacatcctcg aagagatcaa gcgcgtcaac ggcgctcttg gcactgcggg 
;:gattctctaa accgaaaggc acgccgcatg 'attccctcac aagagtctca caatccgaac 

GACCCGCGAC AGCACGTCAT GTGGGCGCTA CGCAATCTCC CGATGATTGC TGGCGTCGGG 



WO 93/16172 



PCT/US93/00913 



17 



21241 GCGATCACGC ATCCGGGTTA CCTGGCGGAT TGGTCAGAGC ACTTGTGGAA GTGCGGCTTT 
21301 CGGCACGTCG ACTGGCTCCG GGAGCTGGCT GATGAGGACG GCAACATCCA CGTCAGTCAG 
21361 CTTCCTGACC AGGAGATCAA GTTTCAGCAG CCCTTCCGGG GCCAGCGAAG CGACTACAAC 
21421 AACGCAGCTC GATGGGTCGG CAAAGACGAT CCTGACCCAG AGCCCGTGCG TATTCCAGAC 
21481 ATTCGCAAGC TCACAGACCA GGAGAACAGA GCGATGATCG CGCAGTACGA ACGAGACGGT 
21541 TGGATCAAGG ATGGATCCC- CGGCCCAGCG ATAGCCGAGG TCGTGGAGTG ACCCCGTTCA 
21601 ACCCAGACTC CATAGGCGAC TACGTGACAC TGCTCGGCGT TGCGTTCCTG ACCTTCTCGG 
21661 TTCCCGCAXG GTTCACCGGA CGAGCACGCA AGCACAGCAG TGACATCGGC GAAAtCAAGG 
21721 AACAGGTATG TAACACCCAC GACACGAACC TGCGCGATGA CCTCGACAGC GTCAAGGCAG 
21781 ACATCAGCGA CTTGAAAGAG ATTGTGTTGC AAGGGTTCCA CCAGGTGAAC GAGTCGATCA 
21841 ACCTCGAGCG CCGTGAGCGG ATCGAAGGAG ACCGCCGAAA GGAGGTTGCG TGACCTACCC 
21901 CACCAACCCA CTAGAGGCCA TCGGCGCTGA CGGCGCATTC GAGATCGGTG GGGGCGACTG 
21961 GAGCTTCGGC CAGGACTACA CCGAACAGGC CATCCGGGCT CTGTTCACGA TGCCAGCGGT 
22021 CACGATGGAG AACGCTCTCG GCCTGCTCGA AGAGCACCTG CTGAAGCTGC CTCTGGAGGC 
22081 GCTGCAGGGC TTCAAAGACA TGATCCCGGA CTGGGTCGAA GGAGCATTCG ACACGGTCAC 
22141 CGGCGCTGTG CAGGCGATCA TGAACGCGCT CCAAGACGGC CCGCTGTTCC TGAAGTTCGC 
22201 CGAGTTCCAG CTCTTCCTGC AGCGTCTGCT GAACAACCCG GCCGAGGTCA TCGGCGAGAT 
22261 CCCCCAGACG TTGATCGACG GCCTACAGGA CGCGCTCAAC ACCGTCAACA ACACCATCCA 
22321 GACCATCGTG GACATGCTCC TGCAGGCGCT GGGCATCACC CCGGAGGGGG AGCTGATCGA 
22381 CCGGATCTTC GACCTGAGCG ATGAGATGGA GTGGCTGCAG ACCGCAGCCT CGAATGCAGC 
22441 TACCGGCATC CAGGACACCT GGAACAAGTT CTGGGGAGCC CTCACCGGGC GCGTCCCAGA 
22501 CCAGGACCAG ACCGTCGCTG AGCCCGCCGA GCGTATCGGC GAGCTGGCCG GCACCACGTC 
22561 TGCTAACTCG TCTGCCATCG CGGAGCTGCA GCGTCGACTG GACAACCAGC AGAACGCTGG 
22621 CGGCGTGGCC GGCGGTGACG ACTTCGAGCG ACTGAACATA TCCGGTTGGG ACATCAGGTA 
22681 TTCCAACGGA TCCAGCGGCC GAGGGTACTA CCGTGCCGAC GGCCACCAAC TGGTCTGGAT 
22741 GGACGAAGGC AACCAGCAGA ACACCGCGAC GTTCGTCCGC ACCAACCCCG CAGACGAGAA 
22801 GACAGCCACC GACTACCAGA AGATGACGTT GGTCOTCGGG ACTATCTCCG GTGAGGTACA 
22861 GACCGTGTTC CCGCCGCAGG GAGGTTCGCA CACCCGGCTA TGGGTCCGCG TCAACGACAA 
22921 CGCTCCGACC GTCGGCATCA CCGACGGCGT GTTCGTAGAG ATCGGCGGCG TATCGAAGGC 
•22981 CCAGATCGGC TACCGCCGCA ACGGCAATGA CACGTTCGTC GGATCTATGG TCGACTGCAC 
23041 CTGGGGTGCT GGATCGATCT TCGCTCTGAC CGCCGGCACG GCCAACGGTG CTGAGAAGTT 
23101 CGAGGTCTCG AAGAACGGCC CCGTGCTGGC CACATGGTCG GACGACGGCG TCGTCTCCGC 
23161 GATGGGTGCG AACTACCGCC GCTGGGGCTG GGAAGGCCAG GCTCGTAACC GCAACCTCGG 
23221 CCAGGGCACT CCGAACTCGG TCACCCGAGT GACGATCACC GACAACGATC CTACCGGCGC 
23281 AGGCGGTGGA GCTGTCAACG TCGGAGGAGA TGTCGTAGGT GTACTCCCCA TAGAGAACGG 
23341 AGGCACCGGA GCTTCGACAG CTTCGGCAGC CCGTACCGCT CTCGGAATCG ATGACCTGGT 
23401 CGAAGATATG TCCGACGTAG TTCGTGGATC CGTCGAAGGA CTCCCGTTGA XACCGAAGAT 
23461 CTGGGTAGGA ACAGAAGCTC AGTACACGGC TCTCGCCACC AAGGATCAGT CCACGCTATA 
23521 CTTCAGGACC GCTTAATGAC TGGTATCTCG TTGGGTGTCA ACGACATCCG CAACCTCTCG 
23581 ATATTCTTAG GCGTCAGCAA CAAGATATTG AAGGTCAGTC TAGGCACAGA AAAGGTCTGG 
23641 CCTGCGTTCA CCCCGGTGCT GACCACGTTC GCCACGGTCG GCACGTACAC CTACAACATC 
23701 CCCGACGGGG CCAAGTTCAT CGACGTCATC CTCCTCGGAG GAGGCGGCGG GGGTAAAGGC 
23761 ATGGCCCTGG CTGACGGCTG GGCCAGAGGT GGAGACGCCG GAAGCTGGGC TATCGTCAC. 
23821 CTCGAACGCG GGGTACACAT CCCGTTGTCG ACCAAGACGA TCACCGGGCT CGTCGGAGCT 
23881 GGAGGCGCAG CGGGAGCTGG CTCTGTATTC TCAGGCAAGG CCGGAGGCCC TGGAGGAAAC 
23941 ACCACGGCGT CCGCTGTCGG ATGGTCAGGT TTGACCGCAA CCGGCGGTCC CGGAGGCTCT 
24001 GTGATCGACA TCCTCAGCGT CGCCGGAAAG TCGCCTGGAG ATCGGACCTA CAACGACCAG 
24061 CTCTACATAG GCGGCGCACA ACAGAACTCA GCTGGCGGGA ACGGCAATGC TCCTGGCGGC 
24121 GGCGGGGCTG GTGCCCAGGT CTCCGCACAG AGCGGCGGTG CTGGCGCTCG CGGCCAGGCG 
24181 TGGTTCTTCG CGTACTGACA AGAAACCCCC CTCTTTAGGA CTCAGTGTCC 
; 24241 GGCTTTTTGC GTTTCAGGAG GTCTTGGCCA GCTTGGACAT CGCCTCAGCG ATAGCCTCGT 
" . - "301 CGCGGGCCTC AGACGCCATC TGGTACTTCA TCGCCATCCT AGGAGTCGTG TGACCGAGAC 
24361 GGGCCATCAG CTCCTTGGTC GTCGCACCTG CCTGAGCGGC GAACGTAGCG CCGACAGCGC 
"«1 ggggS GATGCGGAGT TCCGGCCGAC CGATCTTGGC GTAWCACGC JKJWgR 
24481 TGGTGAACGC GGACTTCGAC AGCCGGTTGC CCTGCGTCGT GGTCACCAGG ^GCCTCGG 
: 24541 GGCCCTTGTT CATCTTCGTA CGGTCCTTCA TGTGCGCTCG GATCATCTCC ^ACGTGAG 
. 24601 GCGGAACCGT CACAGGACGC TTCGACCGGA CGGTCTTGGC GTTGCCAACG ACGATCTTG* 

: 24661 ?HccSacgS GGAAGCGCCA CGGCGCACCC GGAGCTT^T cctcatgccg tcctccacga 

24721 TGTCCTTGCG GCGAAGCTCG ATCAGCTCTC CGAACCGGAG ^CGTCCAC GCCAGGATG 
24781 ATGCCGCGAT CCGGTAGTGC TCGAAGATCT CAGCGGCGAC GATGTCCAGC TCCTCAGGCG 



SUBSTITUTE SHEET 



, WO 93/16172 



PCT/US93/00913 



„..i n«ee ggggg ggSI SSSSS SSSSS 

Will ssss ss S SjSf il-gs SKSK 

25021 TCATCTCTGT GACCGCCACT TCACCTAGCA CCGGGTA6AT ^^^^ CGGGTGTACT 

TGTACAGATC CCTGGTGCCG WTGCGAGGT CGCGCTCCAC gGCCACTTC 
25141 CCTCCAGCGT GATGGCGCTG GCGGCTGCCT Jgrgagg* ceTCC ATCTTGT TCT 

25201 TCTCCATCTC GATOAGCCGC TTCTCGCCCG CGAGCCAGGC GAGGCTTGGA 
25261 CGTAGGTCTG GAG|GCGTAG ^CCTCACAC gTCWg^ ACGTGC CAAC TAGGTCTCCT 
25321 TCCTCCCGCT SfGGfGAGTC "CAGCGATC ggjgggj ^ TTCAACGCGC 
25381 CTCGTCGTOA-AttAfGCTAC CGGGTTGCAA fTCCTei^ CGAGAGGGGG TAAAAACCTA .. 
25441 TTCTACGACCil||AjTTTCT "ggggj. J^gSS COMC1MC TACGCGGCTT 
25501 TCTTGACCGG-GGeSTATGTG ^CG^GAC ACCUAiiv.* * GCCCTCTGGC CATTTTTCTT 
lllSl CC-ATTCCCGT GG€||GCTCC GCTCGTCAGA GgTCTTTTC SCCCICT^ 
25621 TCCAGGGGTC TG^AfiCTCTT GTGCGACTCT Jg^"* GCCCCCCGCC TGCGCGAACA 
25681 TCCCTGATCT GGGSACTTTC «gggg JACGAATAGA gg^^ ^ 
25741 GACGAGGGGC ATTCACACCA GMTGGAGCT «f^^ GCGGGCTGGC GTTCGCCCTG 
25801 GTTGCACCGG GA6TTGCAGC GGTCGGAACC CTCGCCGTCG AGAGATGGTG 
25861 TCGTTCACGG CTCTCAGCGA GCTGGCTGCG £CCA«££» CCACAGTGGC CCTCAAGCAG 
25921 CCCTTGGTGG TCGACGGCCT fMGCTCGTC gCggBTCB "^MCWI QGCCGGCAAC 
25981 AACAGTTGGT ACGCGTGGTC SCTGCTGATC C»RW* TCGCTGCGAT CCCTCCGCTC 
26041 GTGGCACACG CCTACCCCCA CGGCATCATC GCGMGGTGA Jg^ACtC GGAGCACGCC 
26101 TGGCTACTGG CGTCGACCCA CCTAACCGTG JgCTGGCGA JgJBg^ TGCCGCXT g A 
26161 GAAGTACCTG TCTCGCGGCC AGAACCCGCG CCTOWggWj A tGTAGGAGG CACAAAAAAA 
26221 CTGCGCCCGA CCGGGACAGA AATACATAGA G^CCTgge GTTCTGCTTC AGTAGACCTT 
26281 TACCCCCCGA GCCAGCCCGA JMCCMCtt ^SSSS ACGGGCAACC ACATTCCGCA 
26341 GCGAGTCCGA CCCGA(TCTGA TCATCGCCAT GMGACCCAG CTGACAGGCA TGACAGTGGG 
-26401 GGTGATGAGC GAAAGCAACA GGTGCATCGC ggOTTCCTC CTg GAGGTCGATC 
26461 CTGCGGCATC GGAGGAGGCG CGACCGGCTA gGCGAGCCC GATGG GTCGCATCAG 

26521 TTGTTGGGGC GGATACTGAT TGGTCATCCC gggg*^ CCGAGC CGAA CGGCCTTGAG 
26581 CTCCTCGACC GACTCGCGCT CCACGCGGAT gjggggg ACACCGAGGT AGTCAGCGGT 
26641 CCGGCCGTCG GCGATGTAGT TGCGGACGGT CTTGGTecito a ^ cGGTTGTCTC 
$6701 CTCCTGGATG QATGCTCTCG GGGGCATCAG CGCGGTCCTC gTOCTT^ 
26761 CCGAACCCTG GATCACGCCA CGATOTGC GGCTCXe&Rto CACATCACGT 
26821 TGACGGTGCT CAACCAGACA TCGAGCTGGT TGGCXAGCi^ GATGACACCG TCTTTATCCC 
26881 CTCCGAGTTC CGCCTGGAGG TCGTCTCGGT TCTCCTGGGT ggggggj CCCATCGTCA 

' -^sssss-sn |« ill is ssss 

TGGTTTTCCA GOTACGCCTC GGCGAGGTTG GCCTCGGCGC GGTCACGGGC CTCGTC^ 

S«| -.SSSS Sggg ggsg HSSS 

"«i TGGGTCTCCT GGTTGACCGC ^TGACO Jf^gg ggScS KSmCIK 
"541 ATAGAAGG CG GTTCTACCTC CTCTGGCATC CAGTAAGTCT £££££ QGC GGTAATCACA 
2760 1- GCCCACTGCT GGTAGAGGAT GTCGAAGAAC TCGTGGTCCT ££^£U G GTGCAGCAGC 
27721 GATCGTCCTC TTCATCCCAT TCGTCGTAGT JACAgTACA GCCGCAGCAG ggg^ 
""1 CGCACTCGTA GGTGCCGTAG TCGTACTCAT CCCAGTCGTC •gg»JJ^ CG ACC GTTCGCT 
CCTTCATGAT TCGGTCGAAC GCACGCGTCT GCACGCGCAT CICCAe^ GTCGCTGTCT 
Hill TCAACCACGC CCATTCGCCG TCGTGGTTGA J^ggg GCTGGGCTGG AATCCGATAC 
"901 c^c-agg^ CTCGAWGTC AACGTCTGCA GTCCGTXGTT JgJ^jgagg ttqtcgGTCA 
. "961 CGTCCTCAGC GATGTACCAG GGCAACTCCT GGCCGTCGAA ^ TGACGCTGGC 
2|021 CCAGTACTTC AGGGAAGGTO TGCTCGGTCA AgGCGTCCC JJgg^ CG ggGCGTTC 
. CCGGAACTCA AG GAACACCA TGTTGTCCGG GCAGTCCTCO ^^ 6TTGA G CTTGTAGAT 

; "141 ggcggtgtaG ACGCCGATCT CGTTGCCCTC CAGGGTTCCA ^gcAGTACAA CTTTGGCTTG 
lllA CGCCAGACCC ATCAGCTCTT CATCGAGACC GTTCGGTGCT GGCAGTACAA 

1GGCATTAGC CCTCCCTCGG AATTACGTAT GCGCTGAACT CBJU ^ GTACTCGAAG 

;. : ^.SU SS SSSclg SS^S SS ACGCACCGGC GGCGTACACC 
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28441 
28501 
28561 
28621 
28681 
28741 
28801 
28861 
28921 
28981 
29041 
29101 
29161 
29221 
29281 
29341 
29401 
29461 
29521 
29581 
29641 
29701 
29761 
29821 
29881 
29941 
30001 
30061 
30121 

30181 

30241 

30301 

30361 

30421 

30481 

30541 

30601 

30661 

30721 

30781 

30841 

30901 

30961 

31021 

31081 

31141 

31201 

31261 

31321. 

31381 

31441 

315J01 

315^1 

31621 

31681 

31741 

31801 

31861 

31921 

31981 



TCCTCGTCGT 
AGGGCACGXC 
CXXXGGCGAX 
GGTGGCGCAC 
TGTCCTCCTT 
XCGGXGCCAA 
GCTCTCTCAG 
AGGTACGGGG 
GACGACGACT 
CGCCGGCCCA 
CTCAGCCGCT 
GGGATCGAGA 
AAGTTGGCGG 
AGGTCCGCAC 
ACGCGAAGCX 
AAGCATCGCC 
GACATGCGCG 
TCCC7GGTGT 

TCCTGAGCGA 
TTGACCTGGC 
CCCTCGGCCA 
GCCTCCCAGA 
TCGAGCAGGA 
GGCACCAGCG 
GTGAAGATCG 
GACCAGATCT 
ACGTTCTCGG 
AACCCGCCGG 
ACGCGGGGCC 
AAGGAGGCGT 
AXGAACACGT 
ACGAGGCGGC 
GCGAGGCAGT 
TCAACGACGT 
CXCXCCXACX 
TGCTGCAXGG 
ATGATCCGGC 
AGCGCCTTGA 
TCCCCTCCGT 
GAGCCTCTTC 
GXAGCAATAC 
CAGCGAGCAG 
TTAGTGGTAG 
GGCGAGATCC 
CTCGGTGAGC 
CGCCTTGCCG 
AAGCACCTCG 
TCCGAGCAAG 
CTCCTCGGAA 
CXXCCTCAXT 
TAGTAGTAGT 
ATGATCAGGA 
TCCXCGAGGG 
GGACATCCGG 
CCCCTGCCAG 
ACCGCCACGC 
CAGCTCGCGC 
CGATGTGTTC 
CGTCTCGGTC 
TCGGATCCAC 



TCGGCCATCC 
CACTTCGXTG 
GAGGGCGTTC 
TCGGCGCTCC 
AGTAATCAGC 
CCAGCACCGG 
CCTCTGAGGC 
TGTATCCGGC 
GGATCATGTA 
TCGCGTTGAC 
CCACACCGGG 
TCCCCACTGC 
TCTTCCCAAC 
CGTTCTCGAA 
CCTGCGCCTG 
GCACGATCCA 
AGGTCCGCGC 
CGAXGAACXX 
TGGCGGCAAG 
GCTTGCCAGT 
GATCCTCGGT 
CCTCCTGATC 
AGCCCTGCCT 
ACCGACTCAC 
TGTCCATCCC 
TGGCCTTGGT 
CCTGGTCCTC 
CCTCGAAGGG 
ACAGACCCTC 
TCTGGAGCAC 
CXCCCAGCXC 
ACTCGAAGGT 
TCAGGXGAGC 
TGACGGTGAC 
GGAAGTACTG 
CCGCTTCGAA 
TGTTGCCGCC 
TCAGTTCCTC 
ACGGCACGTC 
CTCTTCGACT 
TGGGTACTTC 
XXCAXXGACG 
ATCCCCCGGA 
TTCTGCTTCA 
TTCTTGCGGT 
AACAGCTCGT 
TTGAGCCGCT 
TTCGTGATCC 
AGCATCGAGA 
CAGCGCCCCT 
CAACGACCTT 
CACCCTCTTG 
TCACCGCAGT 
GATGTCCTGG 
CTCACGGATC 
CCGATGGTTG 
TGCCTCGCGG 
GTAGACAGCG 
GTCGGGGTGG 
ATATCGCTGA 
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GACTACGGTC 
ATGAGGAACC 
AACTGACCCT 
ATCTCCTTGG 
GCCGTAGAGC 
ACCCATCTGT 
GGGCAGAGAC 
CTCGTGGAGG 
GTTCAGCGCG 
GATGTAGCCG 
GTATGTCGCA 
CTCA GCGAGA 
CTTTCGCGGC 
TGCCTCGATC 
GTAGTCGACT 
GTCCGACGAC 
CXGCAGTGGG 
CXGGACCCAG 
CXCGXXGCCA 
CXCGGXGCGA 
CGAGXXGACC 
GGCCAACCAC 
GXCGAXGXAG 
GXCGGGCACC 
GGCGXACAGC 
CGXCXXGXGC 
GGAGATGAAC 
CCGGGGGXCC 
CA7CXCGAXC 
CAXGCGCXXG 
CACCGGCACC 
GXCGCXGXAG 
CCGGAXGAAG 
GAGGXCXCCC 
AGGCGGAAXC 
CGGACAGXCC 
GAAGXCGXGG 
AACCGXGGTC 
CAAXGGGAXC 
ACCXCGXCXA 
XCXCGGGCXX 
GAGXACGCCA 
CGGXGCGCGA 
XACCGCCCAG 
TCGGCCGGCX 
XCXGCGXCCG 
GGGACAACXX 
GGXAGXCCXX 
CCXXGXAXXG 
CXCXCGGCGG 
GXCCCAGXXG 
GGXGXCXAGG 
CGCXCTGCGX 
XAGGXGXXGG 
XCGGCAXCCG 
CCCGXGACGA 
GCCXGCXXGC 
XCAAXCXCAG 
AGCXXGAACA 
GACACCACCG 



CCGAGGACCG 
GCAXCGGAGG 
XCAGCAGCXX 
CGCGXXCCAG 
GAACCCCACG 
XCXXGCAXCA 
GCGACGAXCX 
CGAATCAGAG 
GAGXAXGXCC 
XXGCGGCCAG 
GAGAACGCCX 
GCCXXCGCCC 
ACCXGGGCTG 
AXGXXGCGGX 
GAGGCCAXCA 
GGCAGCGXCX 
XXGAXGAACG 
GXCXXCCGCC 
XCXXCGACCA 
CCGGXGAXCX 
XXCXCCACGC 
XXCXCGGCGA 
CXGCAGAXCX 
AACGGXGXCA 
AGGXACXCCG 
XCGGCGGCXA 
XTCGCGATCA 
ACCAGCXXCG 
CCGAAGCACX 

AGAGCGCCGA 
ACCCAGGCXX 
AXGTCCAGCC 
XXGCGGAAGC 
XGAACCXCAX 
CAGGXGGCXG 
GGGTCGAXGT 
CXGACCAAGC 
CCGGXAGGGG 
GXGXACCGCT 
CCCGGCGGAA 
CCXGCATCGC 
ACAGCXCXXC 
GAXCGXGGCA 
GXACGCCXGG 
CGGGCCGGXC 
CXGCTXGAXC 
GACAXXGGCC 
GXCCXGGXXC 
CGCCXCXCCC 
AACTGXXCGX 
AAGGXXCGGG 
AXCGGCXCGC 
GCCATGTCAG 
GXGCGAXCXG 
CGGCCXCGXG 
CCAXCGGXGA 
GCGGCAAGCC 
AGCXGACGGA 
GAGCCGGGGG 
AGAAGCXCAA 



TGAACXXCCX 
XGGAGXGAGC 
CXCCXCGXCG 
AXAXXCGGXG 
AACGCXXTCC 
GGXGGCCAAX 
CGXCGXGGAX 
CCCGACAGGT 
GCGAGCXGXC 
CXXCCAXCGC 
CAXGAACXCG 
CACCGCCGXA 
CGXCAGCGGX 
CGCCCGACAG 
CAXCGCCTGG 
GCGCCGGGAX 
XGXGGCAGCG 
A CXX CCCCAG 
GCXXGXCGAG 
XGACGCCCAX 
CGXACTCGGX 
GCGACCGCGA 
CACXGAXCXX 
GGCXCXXGCA 
GGXGGAACAG 
GCXXGGCCAX 
GCXCTTCGAG 
CCAGGAXCXG 
GGXCGAGCAC 

XGGCGAXCCG 
CGXCCXGAGX 
CGGXGGTCXC 
CXXCCAGAXC 
GCCGCAGCXC 
AGGCCAXCTC 
C CGGC XTGTA 
CCXXXGGGGG 
CCXXGCCGXC 
CAACGXCTXX 
XAACXCCGCT 
XACCGCGAXC 
GCGGATCtCT 
GGGXXCACGC 
CGGAXGXCCX 
XCAGGCXTGA 
GCGXACCGAC 
XCACGCACXA 
XCGAXGGCCA 
AGCGCAGCXX 
ACXCGXCXXC 
ACGXGCCGXC 
CAGCCACGAC 
XXCCXCXCAG 
XCGGAGCXGC 
CCAGCGGGCC 
GXXCGXCAXG 
CCGGXCAGCC 
CXCCAXGAXG 
CAGAXGGAXG 
GXGACGGXGA 



CGGCXCCAXC 

AXXGCCXCGG 

CCXGCGGGAA 

GCXGXCAAGX 

GACCXCGGGG 

GXGTGCAGCG 

AGGCAACCGX 

CACGXCCCGC 

CACCGGCAGC 

CAGCXXCXXG 

CTTGGCCACA 

GACCXXCXGA 

CAXCXGGXGG 

CGCCGCCAGG 

CXCAGCGAXG 

GCCGGXGAXC 

GXCCXCAGAG 

CXXCXXAGCC 

CAGAGCCGCG 

CXCCXCAAGC 

GAAAGCGAXT 

GXA CXCCA CA 

GXGCTCEX-AC 

GACCC TCGC g * 

GXCGAXGGXC 

GAGCXXCXXG 

CGAGXGCCCG 

CGXGXCAAGC 

CXGGAGGXCG 

CACGXCCXCG 
ACCGAACXGG 
AGXGXCGACG 
CXCXGGGGXX 
GAXCAAAAXG 
CXTGAXGGCC 
AXGGGXGACG 
CAGCXXCXXC 
AGGCAAXGCC 
GAXCXXCAXC 
AGXXCXGCGG 
CCAAXCAGGG 
XCXCGGGXCA 
CGXAGXXCXC 
XGACCXCGCG 
CCXGAGCCAG 
GGXTCGCTGC 
CCXCGACCXC 
ACCGGXXGXX 
XCAGGXGCXX 
GGXCAXGXAG 
AXCGAACGCG 
GXGGAAGCGG 
XAGCXGXAGG 
CGAAGCAATT 
XXGAXGACGX 
XXCGGCAGGA 
AGCCGGXXGA 
XGGACGAGGX 
CCAAGGXCGQ 
CGCXCCAGCX 
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32041 
32101 
32161 
32221 
32281 
32341 
32401 
32461 
32521 
32581 
32641 
32701 
32761 
32821 
32881 
32941 
33001 
33061 
33121 
33181 
33241 
33301 
. 33361 
33421 
33481 
33541 
33601 
. 33661 
33721 
33781 
33841 
33901 
33961 
34021 
34081 
34141 
34201 
34261 
34321 
34381 
34441 
34501 
34561 
34621 
34681 
34741 
34801 
34861 
34921 
34981 
35041 
35x01 
35161 
35221 
.35281 
35341 
; 35401 
• 3S461 
35521 
35581 



CGGTCAGCAC 
TCTCGTGGCC 
TCGGTCGGTG 
CGAAGTCGCC 
CGATGTCCCG 
TCTCCGCTCA 
CCCCTTGGCT 
TTCTTGTCGG 
GCGTACGGCT 
CAGTACCGCT 
TGGCCACCGC 
ATCTGCTGGA 
TCGGCCGGGT 
AACGTGGCCG 
GAGGCCGTCG 
GGCTGGGCAG 
TGGGGCATGC 
CAGATTCGTA 
GGAAGATCGT 
CGGCCTCGTA 
ACATCTGGTA 
CAACGATCTC 
GGATCGAAAC 
CTGCCAACCT 
AGTACGGCGA 
CTGTGGGAGC 
GTTCGGCCTC 
CTCCGGGTGC 
GACCCACGCC 
CCGAGAACGT 
GGTAGTCGGT 
ACGGCTCCCA 
GCTCACCGAC 
CCTCGGAGAG 
CGACCGCGAA 
CCGAGATGTT 
TCTGGTAGTC 
CAACACACTG 
CTGCGTACTC 
CGCTGGCCAC 
CGTACACCCG 
GCGCAGCAGC 
TGTAGTGCTC 
CACAGACGAT 
TCGAGTAGTT 
GGTCCGAGAT 
TGACCCCCGA 
GAACGAGATC 
CGTCCACGAC 
TGGCGTAGGT 
ACTACTTCCT 
AGAACGAGAC 
TCCTTCCCCG 
* CGATGACCTT 
ACCCCCGACG 
GTGCGGGAAC 
GAACGTCGGA 
GCTCGTAGAG 
CTGTGGGGTC 
TGCTGCCCGG 



CGACCTGCTG 
CAGATCGATG 
GAACGACCGG 
GAAGTAGGGA 
CAACGCACCC 
GAGTTGGTGG 
CGTTACCGGC 
GCCATTGCGC 
TGCCCGTCTT 
TCTCGCCACC 
CGCCGCCGTT 
.CCTTGTCCAT 
CCGCACCCTT 
ACACCCCATC 
TCACCGTCGC 
GGGCGGTGCT 
GCCGTTGGCG 
CTGCTGCTTG 
GGAGCCCT.TG 
GGCGTCTGGA 
GAGCGCCTGG 
CTCGTCCCAA 
CACCTCGGTG 
CCGCAGCTCG 
GAAGATCGGG 
GATGGTTCGC 
TGAGTCGACC 
CTCGGAGTAC 
GATGCGACGG 
CGCCCGGATC 
CTTGCCGGCC 
CGGTTCGAGA 
GTTGGACAGT 
TGCCTTGAGC 
GCtGCCGTAO 
CGTCGTCCAG 
GTCCCAGTGC 
AGCGATGGCG 
CGAGAAGATG 
TCCACCGAAC 
CTGGTAGTGG 
CCAGCCCXCT 
CGACAGAATG 
CTCGACCCGC 
CGCCCCGACT 
CTTCTCGGGC 
GGCCCACAGA 
TTCTCGCTCT 
CCTCTCGACC 
TCGGTTGTAA 
CTCAGTCAGT 
CCCGTACTCG 
GAACATCGGC 
GGTGCCCTTC 
TACAGCTCGA 
AGCXCCGGGA 
TCAAGAATGT 
GTAGCCATCG 
AGAGTGCCCA 
ACCATCCTTG 



GCCTCGATGT 
ATGTGGTTGA 
TAGCAGTTCC 
TCTTCGTCCT 
GGATCGATCT 
AACGAGGTCA 
TTCTCGACCT 
GTCACACTGC 
CTTCGAGACG 
AGGCGCTTCC 
GGCCGGCGCG 
CAGCGCCTTG 
CACGACCACC 
GGTGGAGTGC 
CGACGGCGCG 
CCACGGATCG 
CACTCTTCAT 
GTGATTCGCT 
ATGAGCCCCG 
TGGACGTTGG 
AACGCCAGGA 
CCGAGTTCCT 
TTCGGAGCGA 
GCCATGTCGC 
TGGATCCCCT 
TTCTTCACCG 
TCAGCGGCCA 
CTGCTACCTG 
TTTCGGTCCA 
AGGAATCTCG 
GGCGTCACGA 
GTGATCTCGC 
GACGAGTCCC 
ACTCGGXGGG 
CCCTCCTTGG 
TGCTCGCCCG 
ATCATCGACA 
TGGTCGACCT 
TTGGCGACCT 
GTCTTGAGCT 
ACCGTGCCGG 
CGTGAGTCCT 
CCTAC ATCCT 
AGGGGGTTTA 
CCCCCGCCCT 
CAGCCAGCTA 
TGCCGACCTG 
CCTTCCAACA 
GTTTCCGGCC 
ACGAGTTCAC 
TCGTATCGCT 
ACCGGGCCTG 
CAGGTTCCCT 
TTCATGCCGA 
GATCTTCTTG 
ACACCTCGGC 
TCTGCATGGC 
CGCAGCTCGG 
TCCCCCTGCG 
ACGATGACCT 



AGAACGTCGC 
GGTAGTCCTC 
GGCCCGCGAA 
TGGATTCTTC 
CGGTGGCAGC 
GCCAGGGGGC 
CGGTGGATGT 
TGATCGCGAG 
CCCGACTTGT 
TGAGCTGCCT 
GATCCACCGG 
AACTCGGCGG 
CACGGGTCGC 
TGGACCGCCA 
GTCTCAGGCT 
TCGTAGGACA 
CGACACCGTC 
CGTACGGAGC 
CGAACCTCTT 
CGGTGAACGA 
GCTGGTGGAG 
CGACAGCCTG 
AGAGATCCTT 
TGTTGAGGTT 
CGGAGACTCC 
GGACAGGGAT 
TCTCCCGCAA 
TGAGGGCCAA 
GAACCTCCCG 
TCATCAGACG 
ACGCCGCCAG 
CGCATGGGTT 
ACATCCCCGG 
CTCGCTTCTG 
CCAGACGCCA 
TGCTCGCCTT 
TCCGCGCCGA 
CCATCGCGGC 
TCTGCAGCAT 
TGGCCCCTTG 
GTCGGTAGTG 
CGATGGCGTA 
TCATCGCCTG 
CGACCTCGGG 
CCATCAGGCG 
CCCAGCAGTT 
CCGGCAGCAC 
TATGTCGCCG 
AGGTTTCCTT 
CGGTTGGTCC 
TGAAATAGGC 
CACCACGCAC 
TGGAGGGGTG 
CTTCCGTTCT 
CGACCAGTTC 
CCGGTACAGC 
ACCTCCCTCC 
GGTTCTCGAT 
ACTTTCGGAT 
TGCCCTTGTC 



CGAGGCGTGC 
GTTCTCGGCA 
CTCGGCCAGC 
GAAGTCATCG 
GATCAGTTTG 
AGCGAAGCCC 
CAAGTAGTCG 
GTGCGGTGCA 
AGACCATCTC 
GCGGGGCGCG 
AGCCTGCGTA 
TGTTGACCTT 
TGTACTGACC 
TCGAGTCGCG 
CAGGAGCCGG 
ACTGGTTACC 
TTCGACGGCT 
CTGCGGGAAG 
GAGATCGGCT 



TCGAACACGC 
GTTGCCGGGT 
TCGTCGGCAT 
ACCTCGAATC 
GCTTTCATAC 
TTCTACAGCT 
AGATGACTAC 
GGAGAACAGC 
GCCGTGCTGG 
AGACTGCTGC 
GTGGCCTGCG 
GGCCAGCACG 
GGCGAACTTG 
CACAGCAGCC 
GGCCGGCTCG 
TTTCACTTAA 
TTGGCCGCAG 
CTGGACTCCG 
GCGACATCCT 



CACCGCGTTG 
GGTCAACTCG 
GACCAACGTG 
CTCGATCTCG 
GAACCGCACA 
TGGCATCTTC 
CCTCAGATCA 
GAACTGGGTG 
ATAGGAGGCA 
GCTCTTCGGG 
ATGCGCCCGG 
GTTGATGTGG 
GGTGCAGACC 
CTCTCCGTTG 
CTTGGGCATG 
GAACTCGTCG 
GATGTTGATG 
CCGGCGCACA 
GATGCCGXCG 
CACAGCGAAC 
CGGCCGGATG 
CGTGTCGATC 
GGCACCGGCC 
GTAGTCGACA 
GTAGCCTTCG 
CATGAACGTG 
GAAGAGGTGC 
CTTGAACTTG 
GTCGACAAGA 
CGAGCCGTCA 
CCAAGGGATT 
GTCGGCAGAG 
CTCGCAGGTA 
CTTGGTCTCG 
CCGTAGCCGG 
TCCAGTCGCA 
TCCGAACCGG 
CAAGAACTCG 
GAGCATGATC 
GTCTGGGAAG 
GTCCTTCTCC 



TCAGCCCAGC 
TCGGCTGACT 
TCCTTGGTCG 
TAACCCTCGG 
CGCCGGATGA 
GCCACCGTGC 
TGGGCGAACC 
AACCGCTTAT 
ACTCCGAGAT 
TCGGCCACTT 
ATCAGGTCGA 
CCGAGGTTGC 
ACCCGGTTGG 
CGTACGGCTC . 
TCCXCGCGGG 
TCAACCTCGA 
AACTTGTCGA 
CCGCCGGCCA 
AGCGTGATCC 
GGCAGCGGGC 
CGGCTCACGT 
AGATCGACCA 
CAGTCGTGGC 
TGCTCTGGAT 
AGGTAGTGGT 
AACTGGAAGT 
TGCGCGTTCT 
GTCATCAGAC 
GCAAGATTGC 
GGCTTGGTCC 
TCGTCAGTCA 
TCGCCGCCAG 
ACGACGCCC? 
TCCCGCTGGA 
GAGTGAAGCA 
TCGGCGGCTG 
GCATCCCGTT 
GAGATCGGCG 
GCGr.TGTTCG 
ATAGCGTGCT 
ACGGCAGCCG 
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35641 TGATCGCGAT GATGTTGACG TGCTCGGTCA GCGACTTGTG AGCGCGGAAC AACCGGTTCT 
35701 GCCCGCTCTT ATCCTTCGGG GAGATCCCGT CGGTGTAGCG GCTCCTGATC GCCTCTGCAT 
35761 AGCCCCCGTT CTGAGCGTCC AGAGCCTTCA TCGCCAGCGG GAGGATGTCG ACCAGGTACC 
35821 GATTGGTCGA CTCCCCCTGC AGAGCCTCTT TGACGTTCTC GGACGAGTAG TGGCTGCGCT 
35881 CCTGGAACAA GTCGCGGGCC TTGGCCGCTC CCGACAGGAT GTTGCGAACC TGATTGCGTA 
35941 CGTAGTGAAC TGCCTCACCA CGGTGCAAGC TCTCCAGCGT CTTCTGGATG TACGGGCTCT 
36001 CGAGGTACCA GACCCACAGC TCTTGGATGA TCTCCTCGGC TGTCAGGTTG GTCTCCCAAC 
36061 CGATCAGCGC CTTCCGGGTG GCCCTGCTGA ACAGCTTGCT GATGTCGTCG GTCAAGGCAT 
36121 CACCTTTCGT AGGTACTCCT CCCGGTCCAA TCGGCGGTCG AGGTGTCGAG TGACCTCCTC 
36181 CGCGAAGACC TCGCGGACTT CGCTGGAGGT GATCTGGCGC GAACGTGCGT TCTTGTGCAG 
36241 GTACGGCAGC TTGGTGGCTG TCAAGTTCTA GACCTCCCAG ACTCGGCCGT CGACCGAGAA 
36301 CCGGCCTCCG ACAATCGGAA * CAAGCTCAGG CTTGACGTGC TGGCCGTCGA CCGTCAGCAG 
36361 AGCAAAACCA CTCTGCCAGT TGGCTGTTGC ACCCTTGAGG TACTGAGCTA GCTTCATGTT 
36421 CATCAGGTTG CCGACCTCCA TCGACCACAG CACCTTCTGG TTGCCGCCGT AGCCCAGCGT 
36481 GTGTGGCTTG ATGCCCTGGC GGTGGGTGTG TCCGATGATC ACCGACGTGC CGAACCGCAT 
36541 CATCGCGTTG TACGCGGTGT CAGCGGACTT CTGCGTCACC CGGACCCCAC CACGGTGGCC 
36601 GTGGGTGGAG ATCCAGCCTG GAGCGATCTT GTAGAACTCA GGCAGCACGT CAACACCGAA 
36661 CCCGTCGAAG TCCAGCAGGT TCTGGAACTG GAACGAGCTG ACGTACTCGA CCAGCGCCGG 
36721 GGCGAACTGG TGCAGGTAGT CGACTGGCCG GCGGTCGTGG TTGCCCTCGT GGACACCAAC 
36781 CGGGCCGTCG TAGACCTGGC GCAGCGGCTC CAGGAACCGC CGCTTGCACT GCTCGGAGTC 
36841 GGGCTTGATC CGCTGAGCGA ACTCTTCCTT GGTGCCCTTG GTCCACCGAG ACGGGCTCGG 
36901 GTAGTCCATC AGGTCACCGA TGTGGACGAC CTCGTCAGGC TGGGTGTCCC CGATGTAGCC 
36961 GATGACCGCC TTCAACTGCT TGCGATCATC GAACGGAATC TGGGTGTCCG AGATGACGAC 
37021 GATGCGCTTG CTCACTCAGC GACCTCGGTG AAGGGGCCCC GCATACGTTC CTCGTGGGAG 
37081 CTGGCGTTGC CTCCTGACCA GCGTCGCTTG CCCACCTTGG TGTGGTGCAA CCCGTTGGGG 
37141 TAGTAGATCC ACTTCACTCC TGTGGCGTTG GTGACGGTCT TCACATCGGC AGGAACGTCC 
37201 AGCAAGGTGT CCCACTGGCG AGGCCCCTTG GGATACCGCT CGTCCTCGGG GAGCTGCATC 
37261 TTCTCCAGAA CGCCTGCGTA ACCGGCGATG TCGACCACCG TGTCCTGGTG GTAGCCGTTC 
37321 TCCATGAACC GGGCGATCTT CAGCAGGATC ATCATGACGG CCACGTCCTC CGGGGTGAAC 
37381 TCGACGCCGC GCTTGTACGC GCCCCACAGG GTCGCGATGC GTTCGTGGTT CTCCTTGGCG 
37441 TCCCCOXAGT CCTGGGCTCG CTGTCCGTTG ATGATCTCTT CGGCGGTGGT CAGAATGCTC 
37501 ACAGTCCAGT CTCCGATGCG GTGTAGTAGT CGATCAGCTC ATCGAGCTGG TCCGGTTGAT 
37561 AGCCGAGGAT CGGCTTGTGG GTGlCAGTGA CGACGACGGG AACCGACATC GCGTTGAGCA 
37621 CCTXGGTGAC GTAGTCGTAC GCCTCCGAGT TGGCCGTGAC ATCGACTGCG TCGAAGTCGA 
37681 TCCCGGCAGC CGTCAGCTTG TCTTTGACTC GCTCGCATGG CTTGCAGCCG GGACGGGTGT 
37741 ACACCGTGAC CGGCGCGAAC AGCGTTCTCA CGTGAGCACC ATCCCAGTCG ATGTATCGGT 
37801 CTCCATACAT CAGATCCTTT CCAGCAGAGC AGCTTTGCCC TGCGATGTGA CTAGTGAGTT 
37861 GACATCCTCG CCTTCTGGCA TCGGGATGAT TCGGGCGTTC GGCAGCGTCT TCGCCACCGA 
37921 CCGGGCGAAC TCCATACCGG CGTCGTCGCC GTCGGCCAGG ATGTTCACGT TGCGGTAGCC 
37981 CAGGAACAGC TCTCGGAAGT ACGGCTTCCA CTTCTGGGCT CCGCTGAGCC CCACCGTCGG 
38041 CAGCCCACAC AGCTCGGCGG TGATCGTGTC GAGTTCTCCC TCGCAGATCG CCATGTCCTT 
38101 GCTGTATTTG GTCAGCGCGT AGGTGTTGTA GAGCCGGTCC TTCTCCCCTG GCATCGACAG 
38161 GTACTTCGGT GTGCCACCGT CGATTCGGCG ATACCGGATC GCAGCTACCG TCCAGTGACG 
.38221 CCAGGGCGAC CACCGCATAT ACGGAATCGC CAGGCAGCCC CGGTACATCT CATGTCCAGG 
38281 GAGTGGGTCG TCCACGAATC CCAGACCGAA CCGGCTTAGT TCCGCTCGGC CGGCCAGCCC 
.38341 GCGACTCGCC PJ ATACTCGT CGGCTGGGCT TCCGGGCAGG CTTTCTCTGT ACCGGGACGT 
38401 TGCCTCCCAC AC-.TAGGTTC TCTGCGATTC GCTTAGCCTC TGCAAATGTC ACCTCCTCTT 
38461 CGTGACGAAT GATCGAGATC ACGTCTCCAC GGACCCCGCA GGCCATGCAG TTGTAGCCCT 
38521 GTAGGTCGTA ACTGACTGCG GCAGACGGCG TTTCGTCGCC GTGGAAGGGG CACAGGCACT 
38581 TGTTCCACTC GTGGTGGTCA GGTGGTGGTT CCCAATCCGG GTGGTAGCGA AGAATCGCCC 
38641 TCGCGATGGG CGAGTCGTTC ATTCGTCCTC GTCAAGCTCC TCGGGAGAGA GCCCTTCGAA 
38701 GATCCCGTTC AGGACGGCGG CGAAGCCCTC GCCGGTCTCC GCTGCGTCGA GCATCTCTGC 
38761 AATCGTCTTT GCCATGTTTC CTCCTGGTGG ATGTCAAGTT CGAGACAGCT TGTCAGCCTC 
38821 GACTGGAGCG ATGCGCTCCC CGATGACTTG GACGGCCGGC GGGTTCAGCA GGTACTCGAT 
38881 GGCCCGTTTG AAGAACTCGA TGCAGTCCCT CGCCCAGCCC AGCGTGTACT TGTTGCACAT 
-38941 " CGTGCAGAGC AACCCTCGGA CGATGCCTGT CTTGTGATCG TGGTCGACCG ACAGGCGCTT 
39001 CTTCTTACCG TTGGCTCGCT GGCAGATGTA GCACCGACCA CCTTGGAACT CGTAGATCTG 

„ 39061 - CCAATACTCA TCGCCGGTGA TGCCGTAGGT GGCCAGGATC CGGGTCTCCC AGCTCGTAGA 
39121 GCTGCGAGCC GTCCTGAACT CTCGGTGATG AGTAGCGCAT JCGTGGCCCTG GATACTTGGC 

• 39181 GTCTCGCGTG AGCGGGAGCC CCTGTGCGAC ACAGTCTTTG 1CAAGGCTTCC GCTTGTGCTT 
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39241 
39301 

39361 
39421 
39481 
39541 
39601 
39661 
39721 
39781 
39841 
39901 
39961 
4O021 
40081 
40141 
40201 
40261 
40321 
40381 
40441 

.40501 
40561 
40621 
40681 
40741 
40801 
40861 
40921 
40981 
41041 
41101 
41161 
41221 
41281 
41341 
41401 
41'461 
41521 

' 41581 
41641 
41701 
41761 
41821 
41881 
41941 
42001 
42061' 
42121 

42181 
42241 
42301 
42361 
42421 
42481 
42541 
42601 
42661 
42721 
42781 



ACGGTTCTGC 

CCCGGTTCTC 

CCACCAGAAC 

ACGCCCTTTG 

GCTGGCGATC 

ATGACGGGAA 

CCGAGGGGGA 

ATGCCAGCCC 

CATCCCTCCT 

CCGCCGTGGA 

CAGACCGTGG 

TCGCCTCCCA 

GGGTGGCCAG 

CGCTGATCGG 

GTTCTCCTCG 

GATCACGCGA 

GGTCGGCTTC 

GGGCATAGCG 

CGTAGTGAAG 

GATGTCGGGC 

TGCGCCATCG 

CTTGATCCCG 

GATTGAGCAC 

GATCCGGCGA 

GGCGTCGGAG 

GATGATCTTC 

CGTTGATCTG 

CCGACTTCCC 

COGATACTCG 

CCGA'CAACGG 

CGACGCATGA 

CCGAGAACGG 

CGATCAACGG 

CGTCCAACGA 

CTACCGCGTC 

CGCTGAGGAT 

ACAACGTCGG 

CGGGGCCGGC 

GCGCGGCCCA 

TAGGTGTG7A 

GAGACCAGCC 

GACAGGATCA 

CTCCACTGGG 

AGAACATCTG 

GACACATGCT 

GGGTGGGGCC 

CACCCGAGGG 

ACCGTGTCGT 

TCGACATCAT 

GACGTGAACC 
ACCAGCGCGT 
CGGTCGAGGA 
CGATCAAGGC 
CTCGCTGCAT 
GACCCTTGTA 
GTGACCTCAG 
CAAGTTGGTC 
GTTCGGCTCG 
AACTCGATCA 
GCAGCCTCTG 



ACCCGGTACC 
CATCACCATG 
TTGGCCGCTC 
TCGAGCGGGG 
TCGTAGCCGA 
TGCCGGCCCT 
ACGCCAGACA 
GCTTGCCGTA 
GCTTCATCCC 
CGACCGTGAG 
TGCGGTCCTT 
CTGCAGGCCG 
GAACTGGATC 
GTATCTGACT 
GAGACGTTGC 
GGCTTGCGGG 
GTCTTGATCA 
TTTGGAGTGG 
CCCCCGCAGG 
AGGATCGCCT 
ACCTGCTCCA 
TCCTTGGTCT 
CGGCCCTGGA 
TTGATCTCGA 
GTACACGCGG 
TTCATGTTCG 
CATAGTGTCT 
CCCTCGGTTC 
GTGGAGAGTG 
GATCGGCTTG 
GCCTGTCTCA 
GTCGTCTCCC 
TGGGAAGTCC 
CGGTGATGCC 
CTCGATGTTC 
GGACACCGAA 
CACCTTCGAC 
GCAGACCAGG 
GACCGGGGGT 
CACCTTCCTC 
GGCCCCAGGC 
TCGGCGAATG 
TGTGTCGGTC 
TGTGCCCGCC 
CGAACACCAT 
CATCGTGGTT 
CATGTATCTG 
CCTTGCCGAC 
CCGCCAGGTT 

AGGTGTTGCT 
GCAGCTCCTC 
TTGCGACGAT 
GTCGTGAATC 
CTGGTTCATC 
TCCAGCGCGG 
CGACGGGTGG 
ACAGACCGAA 
ATACCTCTCG 
CGATCTCTCG 
GGGTCGGGAA 



CCGGAGACCT 
CAGAACCACG 
ACTTCACCAT 
TCAGCTCGCG 
GGATCTTGAA 
TTCGGCCTCT 
GATGTCCGCA 
GCGTTCCCAG 
CCAGGCCCAG 
ACCGGAGAAG 
CCAGATCCGA 
TCGTGCGACG 
ATCTGCGCCT 
CCGTATTTCA 
GGGCGAGGCC 
GATCCGGGCT 
GAGCGCCCAG 
TCATCTGGAT 
CATGCGCGCC 
GCGGCTTGAA 
TGAAGTAGGA 
TGCAGGTCAC 
TCTCGAGCAG 
AGTTGTCAGC 
AGAGGCCCAG 
CTACTTTCTG 
CCGACGAACT 
TTGACCGTGG 
AGGATCATCT 
TCGCCGTCGT 
CGGCCCATCT 
TCGCTTGAAT 
TCGTAC\GCG 
TTGTAGTTGA 
TGCTCGCGAA 
CGGGAGAGCT 
TTGAGCGCGT 
ACTAGCTGGC 
AGCGGA7CCC 
CTCGT.GGATG 
GTCGATCCCC 
CGTGTGCCCG 
CTCGCTGGTG 
AGCGTCCCCG 
CCAGTACGCA 
GCCCAGGATC 
CCCCTTGGTG 
GACCGAGTCC 
GCGGATCTCC 

CATGGCTTCC 
TGCCGCGTGG 
CTGGTCGTAG 
GGCCGACCGG 
AGCCGGGTGC 
TGCATCCGTA 
TTTGGATCCC 
CTCTTCCTGG 
CGTCACGAAC 
TCCGGGATGA 
CGGAAACTTC 



CTTCGCCGCC 

ACAGCAGCCC 

TCCTCGAACC 

CTCATCGTCC 

CGACACGTTC 

CGCATGCAGT 

CCGGCCCTGA 

TCGGCTCGGT 

CGGTCTGCGA 

GACCGGTGGT 

GATCCGGTGA 

TGACCAGCTC 

GCTTGTACCC 

CTTGATCCAC 

GGTGAACTCC 

CTCCGGGTCG 

CACCTGCTGG 

CCTTTCCTCG 

CCGCCTGGGG 

GTTGACCTGG 

GACGTTGTCC 

GTCGAGCTTC 

GTACTTGTCC 

GGCCTTGCTG 

GATCGCCGAT 

TTTGGTGGAT 

CCAAGGAAGC 

AGACGTTGAG 

CAGGAACACG 

TGTGCGGGCC 

CGTGTAGGTA 

CGGTGCGGAC 

CGTCATACGC 

ACCGGATCGG 

CAGCCCGCGT . 

GGGTGAACGC 

AGGCGAGGAC 

CTCGTCGGAG 

CCGCCGACCC 

TGATTGACCA 

ACGTGGATCT 

TGGATCAGGA 

TGGTCCCCGA 

TACAGCGGCA 

CCAACCAGCT 
AGCCGTTTGC 

GAGCCAGAGG 
CACGCCTTCG 
ATCAGCCGCT 

TTTCAGAACG 
TCACGCTCGA 
AGGCTGGGCC 
CGCGAGCCGC 
CGCGCAGCTT 
GGACGCAGGC 
AGTTCGTCAT 
TACTGCGGGA 
TCCTGCCCGT 
CGCACGGCCT 
TGCGAGGCGT 



CTCGGCACGC 
TGCCAGGGAG 
CACCAGCGAG 
TCACCGAAGT 
ATAGGCGGTC 
GCCGGGTGCC 
CCATCTCGAT 
GCAGCTCGGG 
TGTCGTCAGC 
ACTCAGTGGC 
TCAGTACTCG 
CGCTTCGTAG 
GAAGGGACAT 
CGCTTCGCGA 
TGGCCGTGGA 
ATCCGCTTGT 
CGCAGTGGGT 
GTGGCTGTCA 
AGAGTTGATC 
TAGAAGTCGG 
GACAGGCCCA 
TTCGACGCGG 
GTGATCCCGT 
ACGTTCTCCG 
CCGGCGATGA 
GTCAAGTTAG 
GAAGTCTTGT 
CATGTCCGGG 
CCCGATCTGA 
GGTGACGTGG 
GXCCATCAGC 
GTTGGtGATG 
GGCCAGAGCG 
GATCTCGTCT 
. AGCTCGTTCG 
ATCAGAGTCG 
GAACGCCGAC 
ATGTGTACCT 
TCGGATGTAG 
GGTCATAGAT 
GTCTCCGGTG 
TCTTGCCATC 
CGTATQGGAA 
CCCGGATACG 
TGTGAGCATC 
GGCCTGGCCG 
AGATGTCACC 
CCAGGGTGGC 
TGTGXCCGAT 

GCGGGCCGTA 
ATCCGCAGCA 
TCACTTCACC 
GTGCGTCTCG 
GAGGATCTTC 
TGTCTCGTGC 
GTCTTCCTCT 
TGAAGTGGCC 
TCCATCTCCG 
CCGCTTGGGC 
ACAGCTCCTG 



GCGTCCTCCT 
ATGTAGAAGG 
ACAGCGCCTT 
CGAACTCGAT 
TCCGAAGTTG 
GACTGAGTTG 
GTTGCGGAGG 
GAGCACGTCC 
GCCGCGAGCG 
CAACGCTTCC 
CCGCATCAGA 
ACGCCGTAGC 
TCGTGGACGC 
TTCGGTCGAC 
. CCTTGGTCTC 
GGGTCCAGAC 
TGGTCTTGCG 
AGTCGGTGTG 
. AGCGCAGTTC 
TCGAGACGTT 
GGAAGTGCTT 
TGTCCGCGTT 
TGAAGAACAC 
ATGCGACGTC 
GTGCGGTGGC 
TGACCGAAGT 
CCCGACGGGT 
CCGAACCCGT 
CCTTTGATGC 
TGGAGCCCGA 
GACTCCAGAC 
TTGTCCACGA 
TTCTCGATCT 
AGTGAGTCAG 
AGCGACCATC 
GCCGAGAAGT 
TTCCCGGTGC 
TTCTGGTCAA 
AGCGATTGTC 
CTCGTCGCGA 
GATGTGTCGG 
GTCACGGAGC 
GTGGCTCAGC 
AGCTGCCGTC 
GCGGTTCATC 
ATCCGAGATC 
TAGGATCCAG 
GTCGTGCTCT 
GTGTAGATCG 

CAGCTCGATC 
GGAATCGTGC 
TTCTTCGGAt 
GCGTCCAAGG 
ATGGTCGCCC 
GGGGCTATAG 
CGGTGGCTGT 
GGCCGTTCAT 
ACCGTCCTCG 
AAACCTGCGT 
GTGCCACTTC 
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42841 UGCTTGTCAG GAATCGGCCC CATTTCCACG TACGTGTAAC CCGCGTCGGG GTCGAGTTCG 

42901 AGCGTTTTCT TGTATTCCTT CGTGCCTGCC TTAGAGGGAA GGTGAGTATC GGTGGCTGTC 

42961 AAGGTGACCT CACTTAAAAA CAGGGCAGCT GTAATTCACA TCACAGAAGC CGCATTTGTC 

43021 AGG7TCAGGC AGAGGCTCGA AGTCACCAGC CTGGATCCGA GCCTCGACCT CATGGAACCT 

43081 CTCGGTGATC CGCTCCCGCG TCCAATCGGT CAGGTCGTAG GGCGCAGTGG GCTTCGCCTT 

43141 GATGCCCTTC TTCCCCGCCA TGAAGTAGTC GCCCGTCTTC GGAGCCTCCA CGTCATAGGT 

43201 CATCGCGACC GCGAGCGCGT ACACGCCGAG CTGGAAGTCG TCACCCGGCG AGTTGCCGGT 

43261 CTTGTAGTCC CGGACTCGAA GCTCACCGTT GACCACGACG ACCGCGTCGA TGAACCCTCG 

43321 GACGCGGATG CCGTCCAGCT CGATGTTGAA CGGAAGCTCG ATGGCCGGCT TGGGCTGTTC 

43381 ACACTCCTTG CAGTTGGTGT CTTTCCACGC CTCCGTAGAG CAGATCCCTC GCCCAGGGGT 

43441 AGTCCAGATC TGCTGGCCCT TGTCCTTCCG CCACGCGATG AACTTCTCTA CCTGCTCCAG 

43501 TCCAAGGTGG AACCGGCGCT CGATGTCACG CTCACCGTTG TACGGCCCGG ACCAAAACCA 

43561 CCACTCGAAG TTCGGGGTTT CGTCGCACAG TGCTCCGATG TCCTTGGCGT ACTCCTCGCG 

43621 GAAGATCTCT TGTGCCCGTT CGAGGCTCAT CTCGCGGCCC TCGGCCAGAG CCTTCTCGTA 

43681 GACCTCAGCG ACGGTGTGAA ACGCGGTGCC CTGCGGCAAC CACGCCGCAG GACGAGCCCA 

43741 TACCTTGTCG ATGCGAGCCA GCTTGTACGC CTGCGGGCAA CGTGTGTATT GGTTCAACTG 

43801 GCTGACGCTT CGCAGCGGCA GCAATGTCTT GGTGTCTGTC ACGCAGCGGC CATCCTTCCC 

43861 TTGCCTATCG TCTCGTTCAG CGCCCCGTCG ACAGCGACAC TGAGCAGTTT TGCGACCTCC 

43921 GACATGTCAA TCGGATCCTT GGGGAATTGG TCAGCCTGAG TCATCCTGAG CACCATCCAC 

43981 TCGGTGCCCT TGTCGCAGTG GATCATGGTC GGATCAAAGC GAGTTCCCCG TGCTACGTAC 

44041 TCGACTTTGT TCGCGGAAAG AATCAAATTC GACACAGGCC GATAAAGTCG TGAGGTGTCT 

44101 TTTACACGAG GACTGCGGTA GACGAGCAGA ACTGAGACTG GGTCTTCGTC CAGTTGGCCC 

44161 TTCCACCACG CCTCACACCT CTGCGCGAAC AGCCACCCTG GATGATCGGC GATGACTTGC. 

44221 GGTGAGGTGT GGACGAGGTT GTCTGCGAAC AGCTTTGCGA GCCGAGTGAG GGGCACGGGG 

44281 TTTCCTTTCG TTGCGCGGCC TGGGTTGGCT CACACAACCG GTCGTGACTT TTAGGGCTCC 

44341 GAGAGAAGCT CCTCGATGTC GTCTGGCCAC GACCAGAGGA GTTCACCCTC GGCGGTGAGG 

44401 TTGGTGTGCT CGTTCACCCG GATCAGGAGA TCGTCATCCT CGATGCCTCG GGGGACGTAC 

44461 CTGAACCCGC CGCCGGCCAT ACCTTCGTAG GGCTCGATGG ATGGGTCGAA CTCGAGCACT 

44521 AAGTCGTCGT CGCGGAGCAT CTTCCACCAC GACAATAGGC GCTTCTTCTT GTCTTCGGAC 

44581 ATCGTGCGGA AGCTACCCAC TCGCATGTAC TCGCCGTGAT CCCGGAGCCT CTGAAAAGCC 

44641 TTCGACTTAT CGTGAGGTTT CCGCGTG7CC CACGGCCAGT TCTGCTGGAC GATCTGCCTG 

44701 GTGGTCAACC GTCCTCCGTA GGTCTTCTTG TGCCACGACA CCGCTTGTCG AGTCACGCCA 

44761 TACAGCTCTO CGATTTCGGT CTGATTAAAC CCCTTCCTGC GAAGATCTTC GATCTCGCTG 

44821 AGAGTGAGTG GTATTCGGCT AGGGGCCGGA ACCACTGCTT TGTGTTGGAT TTTGCCGCTC 

44881 ATGTTTCCCT CCATGAGAAA GGTGCGTGCG TCTCCGCCGA, TTACGGAGAC ATGTTGGTGC 

44941 CTGTCAAGGA TACCCCTAAT TTAGTTGCGT CTGCGGAACC ATATTCAGTT GTGTTCCCCG 

45001 ACGCCGTGGC CGTCTCCCAC TGGGCGTGGG ATCGACTGGC GTTACGCGGT CGTAAATGTA 

45061 GCGGCCTGCC CCACTCGGTA GCAAACCTTG TGACAGGTAT CACTTAGGTC GCCTTCTGT? 

45121 ACACGTTGAC CTCGGGTTTC ATCGTCACGA .CTCTCCTTTC TTAGACAGCC TCAAGATCGT 



45181 
45241 
45301 
45361 
45421 
45481 
45541 
45601 
45661 
45721 
45781 
45841 
45901 
45861 
46021 
46061 
46141 
46201 
46261 
46321 
46381 
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46441 ' CGG7AGAGCA CCAACGCTCC CCCGCCGGCC GA77CCACGG CCTTGTTCTC CTCGGCGGTC 

46501 AGGCGTTCCT 7GACGGCCTG GGCGAAGCCT GCGATCCACG ACCGGCGG7A GCTCTTGAGC 

46561 TGGCCAGCGG TGCTCTTCGG CTTGTACTCC CCGGTGTTGT AGTCGTACTT GTACCGAGGC 

46621 TCGAAAGCCT GCTCCGGGCG GACATTCTCA ACCAGGCGCA TCATCTGCGG CTGCATGATC 

46681 GACCAGAGGA ATTGGAGCCT CTCGATGTGG CGGGGCACGC CGTAGACGTA GATCCGCTGA 

46741 CCGCCCGTGA GGCTGGCGTA CACCGTCTTG CAGTGCAGGG CCTGAGCCAT GCCGTGCAGC 

46801 AACAACGCTT GTGCGGCAAC GTACTTGCCG GTGACGTAGG TGACCCACTG GATGGCGTCG 

46861 GGCAGGTCGG TGGTGTCCAA CCCTTGCTTG CTCGCCTCGA CCTGGGCCAT CTCCAGCCCG 

46921 TACTTGGCCA TCAGCTCGAA CGCTTTCGCC TGGAACACAG CCTCTTCCGG CGTACCGGCC 

46981 ACGTCTTCGG CCTGGCGCAG CAGCTTGGCG ACCTTGTCCT GCATCTTCTT CGTCTTGCCG 

47041 TCGATCATGG TCAGTACTCC TTCTTCCAGT TGTTCCGGTT GCCCTTGCCG GGGCGCTTCA 

.47101 TCTCTCGCTT GCGGTTACGG TGCGGCTGCG CCGCGTTGGA GAGACGCAAC TCGAGCCGTG 

.47161 CCTTGAGCTG GTCGCTCATC TTCTTCACCT CTTCTGGTTC AGCGGATCTG GTCGACGTGG 

47221 ATGCAGCCOA- CGCGGTCTGG CCCGAACTCG GGAGCGAAGC CCAAGACTTC GTCCTCCTCG 

47281 CATGGGAACG CTCGCTGGTC GAACGTGATT GGGTCGGCCG AAGCCTCGTA TGGATCGGCC 

47341 AAGGCCATCG CTCCGACCGC TGTAGCGAAT GCAACGACGA CGGTGATCAG GTGCTTCTTC 

47401 ' ACTCTTCTTC CCTCCACTTT TGGTCTGCGA GAAGCCTTCT GGCGATCTCG ATAGGTTCGA 

47461 TCTCAGGAGT CACTCATCGC CCTCCAAGAT CTTCAGGTTG GCCAGCAGTG CATTGGCCAC 

47521 AGCTCCGATG TGGCCACCGC CCTTACCTCC ACGGCGGGAG TACTCGCGGT TCGCGGCCTG 

47581 CATGAAGTGG AACCTCGGTG AGCCGTCCTC GTGAACCCAC GAGGCTTTCT CGGCGGGCAG 

47641 AGCCCGGTTC ATCTCCACCG ACATCGTGAC GATGATGTGG TCCCTCTGGA GCCGAGCCTC 

47701 GGTCTCGGCG TAGTGGGCAG C77GGA77AC TGCGCCTCGT GTGGTCATGT CTTCTCCTTC 

47761 GGTAGATGTC AAGCTGTCGT CACCACTCTT CGACCGGTAT CGGTTTGTCA CAGCCAGCAA 

47821 GGATCGCGGC GTTGCTGCGG TGATGCCCGT CCCACAGCGT CTTTCGGTCC CTCGAAACCT 

47881 CGAGGGGTTC GAACGGCCAC TCGTTCGATG AGTTGAGGAT GTCCACGACT TC67GGACCT 

47941 TGGCCCAGAA CTTGCCGGTC ACGCCTCCC7 GG7AG77G7A GCGGGGCG7G GTCTGGTAGA 

48001 ACTCTTCGAG CACTGGTCCG C7G7CGGCGA CGGTGCAGTC GACACCAGCG CAGGACATGC 

48061 AGTCGCTGGC GCGGAGCTGG GCAAC77CA7 CGGTGGTCAT GAACGCCGTG GTCACATCGA 

48121 GCCTTTCAGG TGTATGTCAA GCGGCGCGGA CGCCGGAATC GGAGAGGTAG ACGCGGTCAG 

48181 CTCCCAGGAA CGGAGCCTGT GTGTTGGCGT GG ACGAACGT GTCGTTCTCG TAGGGGTTGT 

48241 AGGCGATCTT CGATCCCACG AAGTCTTGCG GGAGAAGCGA GATCAGCTCG CCTACGATGC 

48301 CAGCGTGGAC CACCTTGCGG CGCTCGCGCC GTACCTTGTC GCGGCCGGCC GGCCGAACCA 

48361 CACCCTTGGC GTGGGCCAGC AGGACGTGGC CGCTGCGGTG GATGACTCGA CCCTTGAAGT 

48421 CTCCCTCCAA GGCTTGCACC GAGTACCACG GCTTGCCCTC GCGGTGCGTG CGGTGCAGGT 

48481 TCTTGTAGAC GAAGACTCGG ATCGGCTTGG GAGTCATGAG ACCTCCAGTG TGCGAACGGC 

48541 CTTGTAGGCA CTGATGAGTG ACGCCCCCGA CAGCTCGTTA CCGTGCAGGT GATACCTGTA 

48601 TTTCAGATAC ACGGCTTGGT CGACCGGCTT GTACTCGACC GAAGTGACCT CGACAACCAT 

48661 CCCGTCGATG ATCGCGAAGT CTCCAGCGCG GAGATGGGTG GGGAATTTGA TCTCGGTGTT 

48721 GACTACGGTC ACAGCTTCGA AACCTCCCAG GTACCAACGA ACTTGCCGTT GCGCTTGATG 

.48781 TATCCGCTCT CACCGGGCTC GTACCAATCG ACCTCGAACC CGTAGCGGGC GGCGCAAGCC 

48841 TCGAGGTGGT CGAGCAGGAC GCGGCGACCG GACGCGGTAG CTTCTCCGGT. CAGCCCGCTG 

48901 TCGTTCTTGC GGACGATGAG CTTGAACACT TGGTGCCTAC CCTTCTGCGA TGTCTCGGGA 

48961 GATCTCGGCG AAGACTTTCT TTGCCCACGC CACGCCGTCC CAGGTGATGT CGAACAGTGC 

49021 CTCGTAGAAC TGGTCTCGCA AGGCTTCGTT GCCGTCGGCC AGCGTTGTGA CGAGCCGGTC 
49081 ■ GATGCGGTCC TCGTGGAACT TGTAGACCGA GTGGTTGTAC GGGTCAGCCA TATTGGCGTT 

49141 GGCTCGTTTC ACGTTCTCAA CCACGATGGC TTCGAATAGG TGGTTAACCA GCTCCTCGGT 

49201 CATGTTCTAT CTCTCCTCAG TAGTCGCTGT GCTGGGTCTC GAAGCCTTCG AGGTCACCGA 

49261 CCTCGTCGTC GTACGCGCTC GGGTTGCCGC GCCAGTCGTC GCGGAGCCTT TGACCGCTGG 

,49321 CGTTGTAGCA GGCACCACAG TTCGGGCAGT CCACATCGCT CTGGCGGTAG TAGCGGCAAA 

49381 CCTCGCCGCC GCAGCGTTGG CAGTCCCACG CGCTGTAACC AGGGATCAGG AAACCTTGGT 

49441 CGTCGGTCTG ATCAGGGATG CGTCGGAAGT TCTTGGCAGG CATAGCTACT CCTCATAGAA 

49501 ACTCGTGGTT GATGGCTCGG TGGGCAGCCT CGCGGAAGGT CAGCCCGTCG TCGTACGCGT 

49561 CCCGGTACGT CCAG7CCGCG ATGTC7TGGT AACCAAGACC AAAGGTCTCG GTCATGTAGC 

49621 CGTCCAGCGC GGCCATCCAG GTCTCGAAGC TCATGTCTTC CCTCACTTCT TTGTGGTCGA 

49681 GAACAGCACG TTCCTGCGGC CGTTGACGCA CAGACCGCAA CGGGCACAAG CCGATCCCTT 

49741 GTCGTTGATC AGGTCGATGG CTTTG7TG7T CTCCGGGCAG CGCACCGCCG TCGGAAACTC 

49801 GGCCTTGCCT TTGGCGAACG TGGTGTCGAC GTAGGCGATG TTGATGCCCT TGTCTTCCAA 

49861 GAAGCGCGCC ACGTCGATGT TGTCCGGGTC TGCGCTGAAG TACAGCGCCA GGTTGTCGAG 

49921 CCTCTGCGAG TGCAGGTAGA CAGCCGCCG7 CTGAACCCTT GTGTAGGCCC AGAAC7GGAC 

49981 ATCCGGG77G 7CGCGGA7GA C7CGACCCCA AGCGGCCACA 7AGG7GGGGC TGAAGAAG7C 
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50041 -TCCATCCCAG TGGATGCGGA ACAGCTTCGG AGCCTTGCGA CGGTCGCAAT CCTTGACGAA 
50101 CTCGGCGACC ATCTCGGACA GCAGCGTCAC GGTG7CTGTC AAGTCAGCGT CACGCAACAG 
50161 TTCCCAGTTG TGCAGCAGGA CCGAGCTGAC AGCCTTGCGA ACTTTCTCCA GCTTGCCGGC 
50221 GTAGCACACC TTGGCACAGA AGGCCGTCGC GTCCGGGCAG GAGAAGCCTT GACCGGAGGG 
50281 CAGGCCGATG CTGTTGGCGA TACCTACGGT GGCGTTGCCG CCCTTGGTGA CGTGGACGTA 
50341 GTTGGTGACC TTGCGGTCGT TCGAACGCTT CAGCTTGGCC ATACCTAGCC TTCCTTCGGT 
50401 GGCTGTCAAG TTGTTGGATA CAAAGCGCCC CGAGAGGGAG TCGAACCCTC ACACCGCGAA 
50461 CCGTCGCGGG GCCACCGTGC CTAGTCGATA GAGGTCACTC GACTCTCGTG GACGTAGACC 
50521 ACGGTGTTGC CTACGTTCAC CGCGTAGTAC AGGCCATCGG CACCTCGTAG CTTGTGCCGA 
50581 ACCGTGCCCG ACGTGGCCGT CATGTCTTCG CCCCAGTCGG CGTTAGGTGC CCAGGTGACT 
50641 CGCATGGTGA TCCCTTCAGT AGTCGGTGGC TGTCAAGTCA GCGGATACGG ACGTACCCGT 
50701 TGCCTCGAGC GACGTAGATC TTGCCGTCGA TGTAAACGCG CTGCTGCTGG TTCATAATCC 
50761 TATTCCTTTC GGTGGCTGTC AAGTCTCAGG CCCAGCGACG AGTCGTCGGC CGGGGGCGGC 
50821 GCACCTTGGG CGCGTTGGCT CGCGGTGCCT TACGGATGGC GGTGCCTACC GTGATCTCTT 
50881 CCAACTGGCG TTCAGCCAGG CCGACAGGCC GGGCGTCACC GGGCAGTTCG ATCTTGTAAT 
50941 CGAAGTCAGT CCACCCCTTC AGACCCTTCT CCAGCTCGCG ATCCAACAGA CGCGGAGCCG 
51001 ACAGCTCAGG CGCAACAAAC GGTGTCTTGA CGCTCTCGCG GGCAGTAACC .CGAACCTCAC 
•50161 GGTGCTCAGC GAAGACTGGC ATAGTTCACC CCTTTGGTGG ATGTCAAGCC TGAGCACCAA 
51121 AGCTCAGGCG TAGTGGGTAG TCGGGAATCG AACCCGATAG CTTCATAGCC ACGTTCTACG 

51181 GCTCAGCCAT AGCTCAGCGA TCATTCCATC GCGCCAAGAG CTACCCTCCC GAATGCCGAA 
51241 CCAAAGCTCA GCATTCGTAA GTGTGTATTC TCCCCGTGGC TCAGACAGTA TCTATCAGAA 
51301 CCTAACCACA GGTCTACATT TAGTTATCCG CAGTGCTCGC ACTTTAACGG CATCGAGCTT 
51361 CCGCCGACCC TCAGTCCTCT GGCAGCGAAC TAAAGGTTTG AGTCGGGCTG CGGCCCTTCT 
51421 CGGTCTTGCG TGATTCTCAC TCTACCGGAT GTTTCGGTGG CTGTCAAGCG GGCCGTTTTG 
51481 GTGTTGCAAC GATGCCCTCG TTTAGCGCCG CTGGCGTAAT GCGCTACCCG CCTGATCTCA 
51541 CCGGTCCAAG TTGGTGATGC TTGCAGCTTA CCCGATAACC GGGTGGCTGT CAAACCGGAG 
51601 AATCTTGCCG CCGGATTTTC ACCGGCACCG GCACGATCCT CTCGGATCCG CCTACCGCCT 
51661 TGCTGCTGCG GTCACACAAG AATGCACTAC TGGCCGGGTG GCTGTCAAGC CCT AATCGCA 
51721 AATTGGTGCC CTAGCTGCAG ATATGGCGCG TTCTCGGTGG CTGTAAAGGG CACTACGTGC 
51781 CGCTATCCGC TGGTCACGCT GGACAGTCCC GGCAGCCCGT GCCGCGCATA GGCTGCTCAC 
51841 TACGTGCCCG GTATCGGCGT TGTCGTGCCG CTGTCGTGGT CGTCGCCCCG TCGCTGTCGC 
51901 TGGTCTCGGT GGCATCGCTT GACAGTCGCC CCGCTATCCC CCGTTGCCGC TGGTCAGACG 
51961 CTAATCCGCT TATTTCGCAT AGGCTGCTCA CTATCGCATC GGTATGCGTA TGCGCTGGTC 
52021 ACATATGCGT GTGGTGGTGG TGTGGTGTGC CTGTGTTTGC GCTGGTCAGC CGTGTGCGTA 
52081 CCGTATCCGC ACACTGTGCT TGTGCGTTTG CTGTGTGTCG AGGCCGGCTC TCGCATCGTC 
52141 GCATGTCAGC GCGGGTATGG GCGTGTATCG CACGCTTTGC TAGCCGCGTG CCGCGGCGCT 
52201 CTCGCATCGC ATCGAGTGTT TGCTGTGTCT CTCATCGTCG CAGGTCAGAA GGGGTAGGGG 
52261 GGTTCCCCCT AGGGGTCGGT CCTTGACCGG TCGGTTAT 
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It is known that during the establishment of 
lysogeny, the L5 genome becomes integrated into the 
mycobacterial chromosome and the attachment site 
( att P) . Integration-proficient plasmid vectors have 
5 been constructed which efficiently transform both 
fast-growing and slow-growing mycobacteria through 
stable integration of the plasmid sequences into the 
bacterial chromosomal attachment site ( att B) . 

Because the L5 sequence is now known, and 
10 because L5 has been previously characterized, the use 
of transcriptional promoters with this 

mycobacteriophage may be evaluated efficiently, and 
host synthesis inhibition may also be evaluated 
efficiently. 

15 Figure 1 represents the genome organization of 

the entire L5 genome. DNA analysis has indicated that 
the L5 genome is organized into a right and left arm 
with the attachment site and integrase at the center 
of the genome. The integration functions have been 

20 successfully employed to construct 

integration-proficient vectors for mycobacteria. 

Part of the L5 genome is not essential for 
mycobacteriophage growth. It has been demonstrated, 
that all or most of the gene 62-61-60 can be deleted 

25 without affecting the cycle of the L5 phage. 
Therefore, there is a suitable region in the L5 
mycobacteriophage for the insertion of reporter 
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genes. It is critical that reporter genes be inserted 
into non-essential regions of the mycobacteirophage. 
Otherwise, the raycobacteriophage will be unable to 
survive and replicate. 
5 The L5 mycobacteriophage may have introduced 

therein promoter gene 62 fused to reporter gene lac Z . 
and this reporter mycobacteriophage will be capable of 
rapid diagnosis of mycobacterial - infection and 
accurate assessment of mycobacterial strain drug 

10 susceptibilities. 

Another mycobacteriophage which may be 
successfully used to produce the reporter 
mycobaceriophages is the mycobacteriophage TM4. TM4 
. has been used to construct a first generation reporter 

15 mycobaceriophage, and has the ability to discriminate 
between M. tuberculosis and BCG. A shuttle plasmid 
may be employed with TM4, and may be useful in the 
construction of recombinant and other 

mycobacteriophages . Unlike L5, which is a broad 

20 host-range mycobacteriophage, TM4 is a 

species-specific mycobacteriophage. However, TM4 is 
not as well characterized as the L5 mycobacteriophage, 
and therefore it is more difficult to analyze its 
functions. 

25 DS6A is a mycobaceriophage that has been found 

to be specific for the M, tuberc ulosis complex of 
mycobacteria. It has been shown to inf ct both 
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m , tuberculosis and BCG; It has been demonstrated 
that DS6A can infect over 3,000 different types of 
m. tuberculosis strains. Current efforts are under 
way to develop DS6A shuttle phasraids containing 
5 Firefly lucif erase genes as the reporter molecule. It 
is possible that a combination of different 
mycobacteriophages may be needed to increase 
specificity and then increase the ability to 
distinguish drug susceptibilities. DS6A grows on BCG 
10 and m. • tuberculosis , but does not. grow on M. smegma t is. 

in anticipation of the need for a diverse set 
of mycobacteriophages that can effect a broad or 
limited range of mycobacterial cells, a total of more 
than 50 unique mycobacteriophages have been collected 
15 and isolated by the inventors. 21 new 
mycobacteriophages have been isolated from soil 
samples from India,. France, England, Israel, Tunisia, 
Carville, LA and New York. In addition, another .30 
mycobacteriophages from both the Centers for Disease 
20 Control in Atlanta and the World Health Organization 
Phage Reference Laboratory in Amsterdam were 
collected. The characterization of the nucleic acid 
content of the phage particles of 30 of these 
mycobacteriophages have revealed that all of the 
25 mycobacteriophages contain double stranded DMA whose 
genome sizes range from 45 to lOOkb as sized on pulsed 
field gels. Restriction analysis has : shown that all 
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of these raycobacteriophages are different, except that 
one of the mycobacteriophages from France had a 
considerable similarity to the L5 mycobacteriophage, 
which was originally isolated in Japan, The host 
5 range of the mycobacteriophages varies greatly, some 

being able to infect only m. smaamatis and others 
being able to infect M- smeamatis, BCG and 
M. tuberculosis , but not 3Yiurq . These 

mycobacteriophages may be developed into reporter 
10 mycobacteriophages and cosmid cloning systems, and may 
provide a source of useful transcriptional translation 
initiating sequences, transcriptional terminators, or 
host-range specificity genes. 

In addition, the choice of reporter gene and 
15 its method of expression are critical. It is 

necessary to choose a reporter gene whose product 
would not normally be found in clinical samples, but 
whose product is also easily detectable. 

Luciferase reporter genes have been used in 
20 many diversified biological systems, including 
E. coli . cyanobacteria, phytopathogenic bacteria and 
Bacillus . The presence of luciferase reporter genes 
can be detected by the emission of photons in the 
presence of a substrate, such as luciferin or 
25 decanal. Luciferin and decanal can permeate 
mycobacteria, and ther by allow for th detection of 
gene products, such as photons. Sine one molecule of 
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the luciferase gene product can yield 0.85 photons of 
light, it is the most sensitive biological reporter 
molecule known. The preferred reporter genes of this 
invention are lucif erase reporter genes, such as the 

5 Firefly lux gene (FFlii£) , the vjfrrio f ischeri lux 

genes and the Xenorhabdus lumin escens lux genes, as 
well as the- E. coli B-galactosidase (IgcZ) genes. 
Lucif erase genes, especially the Firetly lux gene, 
geneate a high amount of luminescence activity. They ..... 
10 generate photons, the detection of which is simple and 
sensitive, using commercially available luminometers 
that can detect 100-1000 molecules of luciferase with 
a linear, relationship to enzyme concentration. In 
addition, it is unlikely that clinical samples will 

15 contain significant levels of endogenous luciferase 
activity. 

In choosing transcriptional promoters to be 
introduced into the mycobacteriophages , it is 
desirable to use strong promoters since this will ( 
20 increase the sensitivity of the system. In addition, 

it . is important that the promoter be active following 
- mycobacteriophage infection. The best promoter, 
candidates currently available are the BCG hsp60 
promoter and the L5 gene 62 promoter, which are of 
25 comparable strength. The hsp60 promoter gives good 

levels of luciferase expression from plasmid 
recombinants, but lower levels of luciferase 
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xpression where the mycobacteriophage is TM4. It is 
possible that the reason for this is that the hsp60 
promoter is shut off by the TM4 enzymes following 
infection, thus producing only a modest level of 
lucif erase. The gene 62 promoter may behave in a 
similar manner with the TM4 phage since the gene 62 
product is a good candidate for the L5 repressor and 
is expressed at high levels in the absence of other 
mycobacteriophage functions. Knowing the sequence of 
the mycobacteriophage used will help in identifying, 
characterizing and cloning the appropriate promoter to 
be used in the reporter mycobacteriophages of this 
invention. 

There are several methods which can be 
utilized to introduce the reporter genes and 
transcriptional promoters into mycobacterial 
species-specific mycobacteriophages. One method is 
the utilization of shuttle phasmids. When utilizing 
shuttle phasmid technology, it is necessary to know 
the sequence of the mycobacteriophage so that the 
reporter genes are inserted into non-essential regions 
of the mycobacteriophage. Insertion of reporter genes 
into non-essential regions permits the 

mycobacteriophage to survive and replicate. In order 
to use the shuttle phasmid : methodology, it is 
necessary to first generate a cosmid library of large 
double-stranded " recombinant DNA fragments of 
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mycobacteriophage. This can be done using cosmid 
cloning in e. coli . Next, the cosmid library is 
introduced into the mycobacteria of interest to select 
for cosraids which have been inserted into 
5 non-essential regions of the mycobacteriophage. The 
shuttle phasmids, which consist of the E, coli cosmid, 
the reporter genes and mycobacteriophage promoters, 
may then be characterized. Shuttle phasmids can be 
propagated in E- coli as plasmids, and propagated in 
10 mycobacteria as mycobacteriophages . 

A second method of introducing the reporter 
genes and transcriptional promoters into 
mycobacteriophages is by homologous recombination or 
PCR. First, non-essential regions of a 
15 mycobacteriophage must be determined. Again, in order 
to do this, it is necessary to know the sequence of 
the mycobacteriophage. Consequently, L5 is an • ideal 
: phage to use with this method as its genome has 
already been sequenced and characterized by the 
20 inventors. Next, plasmids are constructed wherein 
reporter genes hooked to transcriptional promoters are 
flanked by mycobacteriophage non-essential region 
sequences in mycobacterial plasmids. Then, homologous 
recombination systems or PCR may be utilized in 
25 m- smggmatis or S^sali to perform gene replacement 
whereby the plasmid constructs, containing the reporter 
genes ar put into mycobacteriophages. . , s . 
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A third method of introducing reporter genes 
and transcriptional promoters into mycobacteriophages 
is by use of transposons. For example, transposon 
IS1096 may be utilized* In order to use this 
5 methodology, reporter genes and transcriptional 
promoters are put into transposons, and the 
transposons containing the reporter genes and 
transcriptional promoters are delivered on plasmids in 
mycobacteria. Next, it is necessary to grow up the 
10 mycobacteriophages on a strain such as yi. smeqfliait i? * 
which strain contains the transposons. At certain 
frequencies, the t: nsposons will hop into 
non-essential regions of the mycobacteriophages, 
thereby introducing themselves therein. The 
15 mycobacteriophages are still viable, and contain the 
reporter genes and transcriptional promoters. 

A fourth method of introducing reporter genes 
and transcriptional promoters into mycobacteriophages 
is by debilitated phages packaged into phage heads and 
20 tails (phage particles). To utilize this methodology, 
it is necessary to develop helper phage systems which 
allow for pieces of DNA containing pac sites to be 
packaged. These helper phages allow for the synthesis 
of head and tail genes at will in mycobacteria, 
25 prevent themselves from being packaged into phage 
heads and tails, and facilitate packaging of pacmids 
into phag heads and tails. Helper phage systems may 
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be generated from the L5 mycobacteriophage . The 
genome of the helper phage is put into the 
mycobacterial chromosome, at which time the 
mycobacteria are grown up. Next, pacmids which 
5 comprise phages which have pac sites, reporter genes, 

transcriptional promoters and mycobacterial replicons 
are transformed onto the mycobacterial strain. The 
production of head and tail proteins may be induced, 
for example, through an increase in temperature, and 
10 the pacmids are then packaged into phage heads and 

tails. The L5 genome has cohesive (cos) termini. 
This suggests the possibility of constructing L5 
cosmid vectors, which could be packaged through the 
cos sites into L5 particles either in vivo, or in 
15 vitro . Then, a . large number, of- genes could be easily 
and efficiently delivered to mycobacteria. 

Packaging into phage heads and tails may also 
be utilized in a fifth methodology wherein the pacmid 
is a-plasmid. The methodology is similar to the 
20 methodology wherein a debilitated phage is used, 
however, instead of using phage pacmids, the pacmids 
comprise plasmids which have pac sites, reporter 
genes, transcriptional promoters, and plasmid 
replicons. 

25 Finally, direct cloning using recombinant UNA 

techniques in vitro may be used to introduce reporter 
genes and transcriptional promoters into. 
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mycobacteriophages . This methodology consists of 
ligating a mycobacteriophage/ identifying or 
introducing unique restriction enzyme sites in 
non-essential regions of the mycobacteriophage, 
5 cleaving the mycobacteriophage with the restriction 

enzyme sites, and cleaving DNA which encodes the 
promoter and the reporter gene so that it has the 
unique sites flanking it on either side. Next, 
ligation is set up in vitro between the cleaved 

10 mycobacteriophage with the unique restriction enzyme 
sites and the reporter gene cassette. The result is a 
circular DNA molecule which consists of the 
mycobacteriophage, the reporter genes and the 
transcriptional promoters. The circular DNA may then 

15 be electroporated directly into mycobacteria . 

EXAMPLES 

Expression of Reporter Gene 
lacZ and FFlux in Mycobacteria 

A promoter probe vector was constructed which 

20 incorporated a truncated E. coli B-galactosidase 

( lac Z) gene as a reporter probe into a shuttle plasmid 

vector that replicated in either mycobacteria or 

E. coli . Random DNA fragments from the three 

mycobacteriophages LI, TM4 and Bxbl were cloned into a 

25 unique BamHl site immediately upstream of the lacZ 

g ne and screen d for their ability to produce 

B-galactosidase. This established that lfi£Z could be 
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used as a reporter gene in the mycobacteria, and 
identified the DNA sequences which could effectively 
express foreign genes in both M , smeqmatis and 
fn, +iiherculosis . B-galactosidase activity could be 
5 detected from lysed cells using OMPG, or from unlysed 
cells using either X-gal or a fluorescent 
methylumbelliferyl B-galactosidase derivative. The 
promoter hsp60 gene highly expressed the iac.Z gene in 
both M. smeamatis and BCG. 
10 The FF lux gene was cloned into pMV261 

downstream from the hsp60 promoter in plasmid pYUB180 
(see Figure 2), which plasmid was shown to express the 
FFlux gene in M. smeQmatis , BCG and tuberculosis 
H37Ra. The expression of the FFlux gene was detected 
15 by observing luminescence of mycobacterial clones 
containing the cloned gene in the dark room, and 
verified use in photographic film. This demonstrated 
that the lucif erase was expressed in the mycobacteria, 
'. and that luciferin, the substrate used, was able to 
20 penetrate mycobacterial' cell walls and yield photons 
expressed by the mycobacteria. 

Detection of Photons In 
Mvnnhacte rial Cells Fynressinq FFlu* 

The expression of FFlux from the plasmid 
25 r vrmifln in ft. smeamatis provided a model with which to 
determine a minimal number _ of individual cells 
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detectable with the luciferase assay. M. smeamatis 
containing pYUB180 were grown in the presence of 
kanamycin to ensure that every cell contained the 
plasmid. The cells were diluted 10-fold serially and 

5 the amount of luciferase activity was determined using 
a luminometer . Figure 3 shows that the amount of 
luciferase activity from 5 X 10 7 cells approached 
10 8 luciferase units, though at this level of 
activity the luminometer was unable to yield an 

10 1 accurate measurement. However, the activity decreased 
in a linear manner down to 1200 units for 500 cells. 
Hence, 5000 cells expressing the FFlux gene can be 
clearly discerned above the background measurement, 
which approaches the number of cells that one would 

15 expect to observe in clinical samples. 

Distinguishing Drug-Resistant Mycobacteria From 
Drug-Sensitive Mycobacteria Using Lucife rase Activity 

Since Firefly luciferase activity requires 

ATP, and ATP is produced only by living cells which 

20 are metabolically active, luciferase is a powerful 
indicator of the metabolic abilities of a bacterial 
cell. Since anti-tuberculosis drugs are likely to 
significantly decrease the metabolic activity of a 
cell, the measurement of luciferase activity should 

25 provide a sensitive means of distinguishing 
drug-resistant mycobacteria from drug-sensitive 
mycobacteria. 
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First, the kinetics of the production of 
luciferase activity of m. smpamatis containing pYUB180 
following the addition of streptomycin, isoniazid, 
ethambutol, rifampicin, ciprofloxacin, novobiocin or 
5 cyanide, added at levels that inhibit the growth of 
M. smeomatis in plate assays, was measured. 

As shown in Figure 4, Panel A,- the levels of 
luciferase production were 100 to 1000 times less at 
eight hours after the addition of the drugs compared 
10 to the untreated control. 

Next, this approach was used to distinguish 
drug-resistant from drug-sensitive mycobacteria. The 
pYUB180 deposit was transformed into 

streptomycin-resistant or novobiocin-resistant 

15 m. smeomatis mutants. Photon production by the 
drug-sensitive parent was compared to the 
streptomycin-resistant or novobiocin-resistant 

. mutants. The drug-resistant mutants continued to 
produce luciferase activity levels comparable to the 
20 untreated patent in the presence of the appropriate 
antibiotic. In addition, the drug-resistant mutants 
produced 100 to 1000 times more luciferase activity 
than the drug-sensitive parent (see Figure 4, Panels B 
and C). Hence, a lucif erase-based assay may be used 
25 to determine mycobacterial drug susceptibility. 
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Construction of TM4 Reporter Mycobacteriophages and 
Detection of Photons Fo l lowin g TM4 ; ; lux Infection 

The first vectors developed to introduce 
recombinant DNA into mycobacteria were shuttle phasmid 
5 phage vectors. Shuttle phasmids have the ability to 
replicate in E. coli as cosmids and then replicate in 
mycobacteria as phages. Shuttle phasmids of TM4 which 
contained the FFIms and lacZ genes transcribed from 
hsp60 and Ll promoters, respectively , were constructed 
10 (see Figure 5) . 

A deposit of the shuttle phasmid (reporter 
mycobaceriophage) phAE39 which contains 

mycobacteriophage TM4, cosmid pYUB216, reporter gene 
FFJjux and promoter hsp60, was made with the American 
15 Type Culture Collection on January 12, 1992 and 
catalogued as ATCC #75183. When the TM4::Ius shuttle 
phasmid phAE39 was mixed with M. smegma tiff cells, 
lucif erase activity could be detected within 15 
minutes of incubation, and continued to increase 
20 slightly over the next 4 hours (see Figure 6). These 
results show that the TM4 : mycobacteriophage is 

capable of introducing the FFlus . gene into 
mycobacterial cells, and that the FFiux gene can be 
expressed in mycobacteriophage-inf ected cells- Figure 
25 7 represents a flow chart for cloning different 
promoters into the TM4::inx shuttle phasmid phAE39. 

A deposit of the shuttle phasmid (reporter 
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rnycobaceriophage) phAE37 which contains 

mycobacteriophage TM4 , cosmid pYUB216, reporter gene 
lacZ and promoter LI, was made with the American Type 

Culture Collection on > 3 -992 and 

catalogued as ATCC #_ The TM4::lac_Z 

mycobacteriophage formed bright blue plagues when 
plated on media containing X-gal. 

rnnst-riirtion of the L5 Sorter MvcobacterQphaqe 

Strategies for construction of the recombinant 
L5 mycobacteriophage may be investigated. The 
possibility of using the shuttle phasmid approach 
starting with L5 deletion derivatives, in which the 
size of the genome . has been reduced, may also be 
explored. Initially, the largest gene 62 deletion 
15 available should be used. However, other deletion 
derivatives in which more of the gene 62-61-60 segment 
is lost should also be isolated. Another approach 
would be to attempt to introduce genes by homologous 
recombination with plasmids. Still another approach 
would be. to transpose lux genes onto L5 using either 
the mini-Mu in vitro transposition system or a 
mycobacterial transposon" such as IS1096. 

Recombining reporter genes from recombinant 
plasmids onto L5 using a double recombination event 
25 may also be performed. This involves first 
constructing a recombinant plasmid that carries a 
reporter gen (lacZ may be more ' suitable) inserted 
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into gene 62 such that both the upstream and 
downstream parts of gene 62 are present. Advantages 
of this approach are that lac Z can be easily detected 
in agar media, that gene 62 is not an essential gene, 

5 and that lac Z is efficiently expressed from a promoter 
immediately upstream of gene 62. An L5 
mycobacteriophage lysate may be prepared by growth of 
the plasmid-containing strain and recombinant 
mycobacteriophage progeny identified by plating the 

10 lysate on wild-type M. smeomatis for individual 
plaques on agar containing the indicator X-gal. 

This recombination approach may be expanded to 
introduce other gene or DNA segments of the L5 
genome. For example, it should be possible to add 

15 luciferase genes from FF lux in an identical manner, 
provided that packaging limits are not exceeded. In 
addition, inclusion of polylinker containing 
restriction ' enzyme sites unique for L5 would open the 
way for construction of L5 recombinants in vitro . 

20 Similar genetic strategies may be used to 
systematically reduce the size of the L5 genome by 
deletion of non-essential sequences. 

Transposition offers an alternative method for 
the construction of reporter mycobacteriophages . A 

25 transposition system which is available is the mini-Mu 
- ill iXltxsi transposition system. This is a defined 
biochemical reaction in which a mini-Mu transposon 
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carrying the desired gene is transposed onto the phage 

genome using purified MuA and MuB proteins. Similar 

transposition experiments have been tried with L5, but 

few L5 mini-Mu derivatives have been isolated. It is 

5 possible that this is due to the relatively large size 

of the transposon used. It is necessary to first 

construct a small Mu transposon which contains the 

reporter gene, a promoter and the two Mu in order for 

these experiments to be successful. 

10 Development of L5 in vivo 

and in vitro Packaging Systems 

g cosmids and packaging systems provide the 

efficiency of mycobacteriophage infection with the 

ability to inject large segments of 

15 non-mycobacteriophage DNA. Analogous mycobacterial 

systems would overcome packaging constraints 

encountered with recombinant mycobacteriophage genomes. 

and allow the introduction of multiple copies or types 

of reporter genes into mycobacteria, potentially 

20 enhancing the sensitivity of the assay. In addition, 

they would help overcome any problems with host 

synthesis inhibition. 

The development of L5 cosmids and packaging 

systems is dependent on the finding that the L5 genome 

.25 contains cohesive termini. The g paradigm suggests 

. that a relatively small region of DNA (approximately 

500bp) around the site (in the ligated form) is 
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necessary to promote packaging. The first series of 
experiments with L5 would therefore be to identify the 
segment of the genome required for packaging by 
constructing a series of plasmids containing the L5 
cos site and surrounding sequences. Cos activity may 
be determined by preparation of an L5 lysate on 
plasmid-containing M. smeamatis strains, followed by 
the identification of antibiotic-resistant 

transductants in the lysate, by transduction of 
M. smeamatis . This assay assumes that plasmid 
multimers of a total size of approximately 50kb are 
present in the cell and will be packaged. Although 
the presence of such multimers has not been 
demonstrated directly, they are likely to be generated 
by the homologous recombination system of 
M. smeamatis . If this assay should fail, cosmid 
vectors which contain both L5 g cos sites may be 
constructed- Insertion of 40-45kb of DNA (as in the 
construction of cosmid libraries) followed by g. 
packaging In vitro and infection with F- QQli will 
generate 50kb sized molecules containing L5 COS site. 
These should be isolated from E. coli and introduced 
by electroporation into M. smeamatis. Assuming that 
one of these approaches is successful, it would then 
be possible to define a small segment of L5 DNA 
required for packaging. 

The construction of frn vivo cosmid packaging 
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systems is a particularly attractive idea since it has 
proven very useful in E. coli . Thermo inducible 
lysogens of L5 may be suitable for in vivo packaging 
of L5 cosmids without further modification, since 
5 prophage excision may be a temperature-sensitive 
event. Efficient packaging of extrachromosomal 
cosmids present in the lysogen may be achieved by 
simple induction and growth at 42°C. 

It is possible that some process other than 
10 excision is temperature-sensitive in lysogen 
induction. If so, it will be necessary to further 
debilitate the prophage in order to prevent DNA 
packaging of the prophage. There are a variety of 
ways to accomplish this. For example, the excise gene 
15 itself could , be deleted . (using a. recombination 
strategy similar to that described above) such as to 
prevent excision. Another approach is to damage the 
cohesive termini (by exonucleo lytic digestion) of an 
L5 _ thermoinducible derivative and construct a 
20 defective lysogen. A combination of approaches may be 
desirable, since even if prophage excision is a 
temperature-sensitive process, the destruction of qqs 
might effectively reduce the background of spontaneous 
mycobacteriophage release. 
25 Construction of in vitro packaging systems 

will follow similar lines. Extracts may be prepared 
from ... thermoinducible strains with non-packagable 
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prophages and assessed for their ability to package 
exogenous ly added L5 cosmid or mycobacteriophage DNA. 
Optimization of conditions should follow both 
empirical biochemical approaches and the 

5 well-established g systems. For example/ it may be 
necessary to supplement the extracts with purified 
mycobacteriophage products such as the terminase or 
the tape-measure analogues (genes A/Nu and H of g 
respectively)/ neither of which have yet been 

10 identified. 

Construction of Novel Shuttle 
Phasmids From Anv Mycobacteriophage 

Although mycobacteriophages L5 and TM4 can be 

used in the development of diagnostic luciferase and 

15 B-galactosidase shuttle phasmids, there may be other 

mycobacteriophages, such as the mycobacteriophage DS6A 

which only infects BCG and M. tuberculosis strains, 

that might prove to have a more useful host range for 

clinical isolates. Diagnostic luciferase 

* 20 mycobacteriophages from these other mycobacteriophages 

raay^ be developed by using the shuttle phasmid 
methodology described herein that has been proven 
successful for constructing mycobacteriophage vectors 
from both TM4 and phage LI. 

* 25 Isolate Mycobacteriophage L5 and TM4 Mutants to 

Infect the Maximu m Number o f Clinial Isolates 

For :: th diagnostic luciferase mycobacteriophage 
system to have maximal use in' the clinical laboratory, 
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it will be essential that to develop a set of 
diagnostic mycobacteriophages that can efficiently 
infect any clinical isolate and possibly distinguish 
M. tuberculosis from M. avium and BCG. Both 
5 mycobacteriophages TM4 and L5 appear to have the 
ability to infect a large number of M t tuberculosis 
isolates, TM4 is very closely related to phage 33D, a 
mycobacteriophage that has been found not to infect 
every M. tuberculosis isolate used to define the 
10 mycobacteriophage typing schemes, for M- tut>?rg\ilQ3is 
. isolates." However, this mycobacteriophage does not 
infect BCG. TM4 has been found to be almost identical 
by DNA hybridization and restriction analysis to 33D, 
and it shares the host-specif icity with . 33D in that it 
15 infects M. tuberculosis , but fails to infect BCG. 
mycobacteriophage L5 appears to share the same 
receptor as mycobacteriophage D29 which receptor -has 
been previously shown to infect a very large number of 
M . tuberculosis isolates. L5, unlike 33D or TM4, 
20 infects all three mdrphotypes of M. gvium including a 
wide range of serovariants . 

If L5 or TM4 are found not to infect certain 
M. tuberculosis isolates, it may be possible to 
isolate mutants of these mycobacteriophages which 
25 plaque on the particular isolate-. The inability to 
plaque on a particular isolate could result from the 
lack of a mycobacteriophage , receptor or be the result 
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of lysogenization of the isolate with a homoimmune 

phage. Phage mutants with altered host range 

specificities or mutants which no longer bind a 

repressor (equivalent to virulent mutant of g) have 

been isolated in other systems. Variants of TM4 which 

can efficiently infect BCG have been isolated at 

frequencies of 10 . Previous work has demonstrated 

that 33D, similarly to TM4 , can not adsorb to BCG 

cells. Host-range variants of TM4 which not only 

plaque BCG, but also still plaque M. tuberculosis have 

been isolated. Similar strategies for M. tuberculosis 

isolates which are uninfected by L5 or TM4 may be used. 

Detecting the Presence of 
M. tuberculosis in Cli ucal Samples 

The combined sensitivities of luciferase and 

raycobacteriophage infections should permit the 

detection of previously undetectable levels of 

M. tuberculosis cells in sputum, blood samples, or 

cerebral spinal fluid. A number of preliminary 

studies to optimize the detection of M. tuberculosis 

cells in a variety of body samples will be performed. 

Detecting M. tuberculosis Grown In Primary 
Human Macrophages and Macrophage CeU hinss 

As a model system for optimizing detection of 

M. tuberculosis in infected monocytes and macrophages, 

primary human monocytes which have been purified by 

adherence for 1 hour or primary macrophages which have 
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been cultured for 6 days in microwells will be 
infected with M. tuberculosis H37Ra at varying 
multiplicities. The number of cells initially 
infected will be determined microscopically, and then 
5 at various periods of time from 2 hours to 30 days, 
the cells will by lysed by non-ionic detergent NP40 
which has no effect on viability of mycobacteria, 
concentrated by centrif ugation, plated for viable 
organisms and infected with the lucif erase plasmids. 
10 Quantitative, studies at different moi's and with 
varying numbers of infected cells will indicate how 
few bacilli/cell and bacilli/specimen can be detected. 

The inability of M. tuberculosis cells isolated 
from macrophages to be infected with diagnostic 
15 shuttle phasmids could result from either the absence 
of the expression of the mycobacteriophage-receptor or 
the masking of the receptor with a membrane from a 
phagosome of the macrophage . The level of expression 
of phage receptors may be regulated by the environment 
20 in which the host cell is grown. For example, the g 
repressor of E. coli is induced by maltose and 
repressed by glucose.' Studies to identify the 
receptors for mycobacteriophage L5 have been 
initiated. Similar studies for mycobacteriophage TM4 
25 will also be performed. By identifying the genes 
encoding the receptor, it is possible to assay gene 
repression of the mycobacteriophage receptor of 
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M. tuberculosis cells when grown in macrophages by 
hybridization for the mRNA synthesis. If the receptor 
is not expressed in macrophages, it may be necessary 
to use a mycobacteriophage which recognizes a receptor 
that is constitutively expressed. 

If the receptor is masked by a membrane of the 
macrophage, the cells isolated from macrophages may be 
treated with a variety of different detergents to find 
a treatment that would allow infection of the 
M. tuberculosis cells with the mycobacteriophages . 
Again, it may be necessary to cultivate the 
detergent-treated macrophages in broth for a few 
generations to gain expression of the receptors. The 
assays to determine the inf ectability of macrophages 
from mycobacteria include not only the lucif erase 
assay for the TM4::ius mycobacteriophages, but also 
infectious centers assays in which free 
mycobacteriophages are removed and 

mycobacteriophage-producing cells are scored by a 
mixed plating on a lawn of M. smeamatis . This assay 
would be useful since inf ectability can be scored even 
if there are insufficient M. tuberc ulosis cells to 
form a bacterial lawn. It is important to re-evaluate 
the host range specificities of all of the 
mycobacteriophages in this assay. Free 

mycobacteriophages can simply be removed through the 
use of specific anti-mycobacteriophage antibodies. 
PetecUnq ^gtwlvni? in Sputum Samples 
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Sputum from a patient infected with 
M. tuberculosis contains a mixture of 

mucoploysaccharide, free M. tuber culosis cells, 
macrophages containing M. tuberculosis cells and a 
5 variety of cellular debris. Sputum samples from 
patients thought to have pulmonary tuberculosis may be 
used for a study in which various numbers of 
M. tuberculosis cells are added to sputum samples 
found to have no or few organisms by acid-fast 
10 staining. A variety of methods can be used to treat 
sputum samples so as to liquify the mucous and 
decontaminate the specimen under conditions in which 
all bacteria other than mycobacteria are killed. 
Because of the specificity of the phasmids, 
15 decontamination may not be as important as preserving 
the mycobacteriophage receptors. Nonetheless/ the 
sputum samples may be treated . initially with 2% w/v 
NaOH for 30 minutes at 37°C or with 0,5% N-acetyl 
cysteine + 1% NaOH. Alternatively, the sample may be 
.20 treated with a variety of hydrolytic enzymes, such as 
. coliagenase, to help dissolve the sputum sample. If 
mycobacteriophage receptors are carbohydrates possibly 
sensitive to these conditions, other conditions may be 
utilized or the cells will be cultured 3-16 hours to 
25 allow recovery of infectivity before mycobacteriophage 
infection. 
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Detecting Mycobacteria In Blood Samples 

Tuberculosis has been known to have a 

bacteremia. If the sensitivity necessary to detect 

100 to 200 M. tuberculosis cells in a ml of sample can 

5 be obtained, levels of bacteremia in tuberculosis 

patients which were not previously observable may be 

observed. White cells should be purified over 

Ficoil-hypaque and lysed with 2% NP40, 1% SDS or 

V f reeze-thawing in the presence of DNAse to liberate 

10 intracellular mycobacteria. The pellet should then be 

infected with the diagnostic lucif erase 

mycobacteriophage, or if only few organisms are 

present they can be concentrated by filtration onto 

filters, and filter areas cut out and infected. 

15 Assuring Specificity On a Variety 

of Clinical Isolates and Species; 
Assessment qZ False positives and Negatives 

The lucif erase assay may be optimized such that 

( positive correlations of M. tuber culosis infections as 

20 indicated in the clinical lab may be obtained- The 

recombinant mycobacteriophages may be tested to 

ascertain the range of specificity that they have for 

other mycobacteria, and for the closely related genera 

Norcardia , Corvnebacterium . and ActinPffiypetes 

25 strains. These strains may be obtained from the 

ATCC. A number of blinded tests including negative 

controls, M. tuberculosis -inf ected patients, samples 
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from patients infected with m. avium , and samples 
infected with other non-mycobacterial pathogens may be 
performed to ascertain the range of specificity. 

The ability to rapidly assess the 

5 susceptibilities of M, tuber culosis isolates to 
isoniazid, ethambutol, rifainpicin, pyrazinamide and 
other antibiotics will have a major impact on the 
treatment of tuberculosis patients. After the 
isolation of M. tuberculosis cells from a sputum 

10 sample/ which may take several weeks, the assessment 
of drug-susceptibilities may take an additional 2 to 9 
weeks. Diagnostic reporter mycobacter iophages may 
allow for evaluations of drug-susceptibilities at the 
time a sputum sample is collected. Alternatively, 

15 this approach would shorten the time necessary to 
assess drug-susceptibilities of purified 

M. tuberculosis colonies grown up from clinical 
samples 

Luciferase Assays for M. tuberculosis 
Cells in the Presence of Dmqg 

20 

The results of the experiments suggest that by 
using luciferase as an indicator for the metabolic 
ability of the cell, it* may be possible to define 
conditions which will enable us to distinguish 
drug-resistant mycobacteria from drug-sensitive 

25 

mycobacteria. To test this hypothesis, isolated 
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mutants of M. tuberculosis H37Ra which are resistant 
to isoniazid, rif ampicin, ethambutol, or pyrazinamide 
would be used to generate a set of cogenic mutants. 
These independent mutants and the parent strains would 
be transformed with pYUB180. Lucif erase activity will 
be assessed in the presence and absence of drugs in 
order to determine the optimal conditions for 
distinguishing between drug-resistant and 

drug-sensitive cells. It is quite possible that the 
window of time to observe differences for different 
drugs could vary and require different incubation 
times for each drug. 

The choice of the promoter for expressing 
luciferase may provide a needed parameter to more 
readily assess drug action. For example, in the case 
of E . cpli , gyrase promoters are greatly stimulated in 
the presence of gyrase inhibitors. 

Clinical isolates of m. tuberculosis may be 
transformed with PYUB1B0 and tested for luciferase 
activity in the presence and absence of drugs. The 
luciferase assays with mycobacteriophage infections 
with lux mycobacteriophages on in vitEQ-grown 
M. tuberculosis cells will first be optimized, and 
then extended to M. tuberculosis cells grown in 
macrophages or isolated from sputum samples. 
Critical Assessment of Drua-Susceptibilitv Testing 
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As for the detection of M. tuberculosis, from 
clinical samples/ the luciferase assay may be 
optimized so that the drug-susceptibility patterns for 
any clinical isolate may be obtained. It may be 
5 possible to add diagnostic mycobacteriophages to a 
single clinical specimen/ aliquot the mixture into 
various tubes and add antibiotic drugs. Thus every 
experiment would have an internal control and each 
drug-treated sample could be compared to an untreated 
10 control. The critical parameter to conclude 
drug-resistance or sensitivity lies in the comparison. 

Although the invention herein has been 
described with reference to particular embodiments/ it 
is to be understood that these embodiments are. merely 
15 illustrative of various aspects of the invention. 
Thus, it is to be understood that numerous 
modifications may be made in the illustrative 
embodiments and other arrangements may be devised 
• without departing from the spirit and scope of the ^ 
20 invention. 
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WHAT TS CLAIMED IS: 

1. A method of producing mycobacterial 
species-specific reporter mycobacteriophages which 
comprises introducing reporter genes and 
transcriptional promoters into the genomes of 

5 mycobacterial species-specific mycobacteriophages 
wherein upon incubation with the mycobacteria for 
which said reporter mycobacteriophage is specific, the 
reporter genes of said reporter mycobacteriophage will 
express a gene product which is detectable. 

10 * 2. The method according to Claim 1 wherein 

. the reporter genes and transcriptional promoters are 
introduced into the mycobacteriophages by shuttle 
phasmid technology. 

3. The method according to Claim 1 wherein 
15 the reporter genes and transcriptional promoters are 

introduced into the mycobacteriophages by homologous 
recominbation or PCR. 

4. The method according to Claim 1 wherein 
the reporter genes and .transcriptional promoters are 

20 introduced into the mycobacteriophages by transposon 
technology. 

5. The method according to Claim 1 wherein 
the r porter genes and transcriptional promoters are 
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introduced into the mycobacteriophages by debilitated 
phages packaged into page heads and tails. 

6. The method according to Claim 1 wherein 
the reporter genes and transcriptional promoters are 

5 introduced into the mycobacteriophages by plasmids 
packaged into phage heads and tails. 

7. The method according to Claim 1 wherein 
the reporter genes and transcriptional promoters are 
introduced into the mycobacteriophages by recombinant 

10 DNA techniques. 

8. The method according to Claim 1 wherein 
the mycobacteria is f! fiih«rr?»losis • 

9. The method according to Claim 1 wherein 
the mycobacterial species-specific mycobacteriophage 

15 is L5, TM4 or DS6A. 

10. The method according to Claim 1 wherein 
the reporter genes are lucif erase genes or the 
B-gaiactosidase gene. 

11. The method according to Claim 10 wherein 
20 the luciferase genes are selected from the group 

consisting of Firefly lux gene, Vibrio fischeri liiA 
genes, Ygnorhabdus luminesc ens lux genes- and lacZ 
genes . 

12. The method according to Claim 1 wherein 
25 the transcriptional promoter is hsp60 or the L5 gene 

... 62 promoter. • . . . b 
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13. The method according to Claim 1 wherein 
the gene product is photons. 

14. The method according to Claim 1 wherein 
the gene product is made detectable by contacting said 

5 gene product with a substrate. 

15. The method according to Claim 14 wherein 
the substrate is luciferin or decanal. 

16. The mycobacterial species-specific 
reporter mycobacteriophage produced by the method of 

10 Claim 1. 

17. A mycobacterial species-specific reporter 
mycobacteriophage. comprising a mycobacterial 
species-specific mycobacteriophage which contains in 
its genome reporter genes and a transcriptional 

15 promoter, wherein the reporter genes express a gene 
product upon incubation with the mycobacteria for 
which the reporter mycobacteriophage is specific. 

18. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 17 

2Q wherein the mycobacteria is M. tuberculosis. 

19. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 17 
wherein the mycobacterial species-specific 
mycobacteriophage is L5, TM4 or DS6A. 

25 20. The mycobacterial species-specific 

r porter mycobacteriophage according to Claim 17 
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wherein the reporter genes are luciferase genes or the 

B-galactosidase gene. 

21. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 20 

5 wherein the luciferase genes are selected from the 
group consisting of Firefly lux gene, V i brio fischeri 
lux genes, r^nnrhabdus Inirnnescens lux genes and lac_Z 
genes. 

22. The mycobacterial species-specific 
10 reporter mycobacteriophage according to Claim 17 

wherein the transcriptional promoter is hsp60 or the 
L5 gene 62 promoter. 

23. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 17 

15 wherein the gene product is photons. 

24. The mycobacterial species-specific 
reporter mycobacteriophage according to Claim 17 
wherein the gene product is made detectable by 
contacting said gene product with a substrate. 

20 25. The mycobacterial species-specific 

reporter mycobacteriophage according to Claim 24 
wherein the substrate is luciferin or decanal. 

26. A method of diagnosing a mycobacterial, 
disease which comprises incubating a sample which may 

25 contain myco- bacteria with mycobacterial 
species-specific mycobacteriophages which contain 
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reporter genes and transcriptional promoters in their 
genomes, wherein the reporter genes produce a gene 
product upon incubation with the mycobacteria for 
which the mycobacteriophage is specific/ and wherein 
5 the gene product is detectable. 

27. The method according to Claim 26 wherein 
the mycobacterial disease is tuberculosis. 

28. The method according to Claim 26 wherein 
the mycobacteria is tubercul o sis . 

10 29. The method according to Claim 26 wherein 

the mycobacterial species-specific mycobacteriophage 
is L5, TM4 or DS6A. 

30. The method according to Claim 26 wherein 
the reporter genes are luciferase genes or the 

15 fl-galactosidase gene. 

31. The method according to Claim 30 wherein 
the luciferase genes are selected from the group 
consisting of Firefly iux gene, Vibrio f jgcheri 

. genes, Xenorhabdus lu minescens lux genes and JLgqZ 
20 genes . 

32. The method according to Claim 26 wherein 
the transcriptional promoter is hsp60 or the L5 gene 
62 promoter. 

33. The method according to Claim 26 wherein 
25 the gene product is photons. 

34. The method according to Claim 26 wherein 
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the gene product is made detectable by contacting said 
gene product with a substrate. 

35. The method according to Claim 34 wherein 
the substrate is luciferin or decanal. 
5 36. The method according to Claim 26 wherein 

the sample is blood or sputum. 

37. A method of assessing drug resistance of 
a mycobacterial strain which comprises: 

* (a) incubating a sample which contains a 
10 myco- bacterial strain with mycobacterial 

species-specific * mycobacteriophages which 
contain in their genomes transcriptional 
promoters and reporter genes which 
produce gene products; 
15 <b) . adding an anti-mycobacterial drug to the 

incubation; and 
(c) detecting whether .the gene product is 
present in the sample, such presence 
indicating drug resistance of the 
20 mycobacterial strain. 

38. The method according to Claim 37 wherein 
the. mycobacterial strain is a strain of 
M. tuberc ulosis. 

39. The method according to Claim 37 wherein 
25 the mycobacterial species-specific mycobacteriophage 

is L5, or TM4 or DS6A. 
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40. The method according to Claim 37 wherein 
the reporter genes are luciferase genes or the 
B-galactosidase . 

41. The method according to Claim 40 wherein 
5 the luciferase genes are selected from the group 

consisting of Firefly lux , gene, Vibrio f ischeri lux 
genes, Xenorhabd ys luminescens JjiX genes and ia£Z 
genes . 

42. The method according to Claim 37 wherein 
10 "the gene product is photons. 

43. The method according to Claim 37 wherein 
the transcriptional promoter is hsp60 or the L5 gene 
62 promoter. 

44. The method according to Claim 37 wherein 
15 the anti-mycobacterial drug is selected from the group 

consisting of streptomycin, isoniazid, ethambutol, 
rifampicin, ciproflo- xacin, novobiocin and cyanide. 

45. The method according to Claim 37 wherein 
the gene product is made detectable by contacting said 

20 gene product with a substrate. 

46. The method according to Claim 45 wherein 
the substrate is luciferin or decanal. 

47. The method according to Claim 37 wherein 
the sample is blood or sputum. 



INTERNATIONAL SEARCH REPORT 



Intc.-iUonal application No. 
PCT/US93/00913 * 



I A. CLASSIFICATION OF SUBJECT MATTER 

tt>C<5) :C12N 07/00; C12P 21/06; C12Q 01/66 

USCL :435/ 235.1, 69.8, 8 
I According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



| Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 435/235.1,69.8,8 



I Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



| Phtttronfr ^* Kfl « g enfwutted during the international search (name of data base and, where practicable, search terms used) 
medline, dialog, aps 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No 




US, A, 4,861,709 (Ulitzer et al.) 29 August 1989, see column 8, 
14-17, and 28. 

Revista Cubana de Medicina Tropical, Volume 41, No, 2, issued 
1989, M.C.A. Jimenez et al., "Phage Typing Marker Study of 
Mycobacterium-Tuberculosis Strains from Ethiopia Preliminary 
Report", pages 192-199, see abstract 

Nature, Volume 351, issued 06 June 1991, C.K. Stover et al., "New 
Use of BCG for Recombinant Vaccines", pages 456-460, see entire 
document 



[ Px| Further documents are listed in the continuation of Box C. Q See patent family annex. 



Speckle 



■ of eked 4 



•A' 

•p. 



^^■^w^H ffff nfa f the g eneral ante of the ait which b oot considered 
to be pmrt of particular r el ev an ce 

mriirr fi^T*** p»kr«KM mi «w after Aa mjfffMtimwl filmy dmte 

which may (hrow doubtt oa priority efauntt» of which b 
. cited to r , *- hlM> the pubBcetion dele of another citation or other 
tpecifj ratsoo(u specified) 

document referring to an oiml disclosure, use. exhibition or other 

dootnxnt published prior to the international fOiag date but Uier than 
the priority data claimed 



later dacuawolpubushed after the tnxeroainnal fuoig dale or pnonry 
date ted obi hi conflict with the appbatioo but cited to undcratand the 
principle or theory underiytni the uwcattoa 



^w.wrt of particular relevance; the denned invention cannot be 
coasaicred novel or cannot be considered to involve an inventive step 



"Y* document of particular relevance; the claimed invention cannot be 

cooaidered to involve an inventive step when the document ■ 
combined with one or more other tuch documents, such combtnsuon 
bdng obvious to a peraoo skilled in the art 

•&* document member of the same patent family 



I Date of the actual completion of the interna t i on al search 
15 APRIL 1993 



Date of mailing of the international search report 



■21 APR 199 




Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 

Box per 

Washington, D.C. 20231 
I Facsimile No. NOT APPLICABLE 



Authorized officer 

KATHLEEN L. CHOI 
Telephone No. (703) 308-0196 



Form PCT/1S A/210 (second sheet)(July 1992)* 



BNSOOCID: <WO 8316172A1_L> 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US93/00913 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Folia Microbiology, Volume 36, No. 5, issued 1991, J. Konicek 
et al., "Gene Manipulation in Mycobacteria", pages 411-422, see 
pages 415-417. 

Nature, Volume 327, issued 11 June 1987, W.R. Jacobs Jr. et al., 
"Introduction to Foreign DNA into Mycobacteria using a Shuttle 
Phasmid", pages 532-535, see entire document. 

Proceedings of the National Academy of Sciences, Volume 88, 
issued April 1991, M.H. Lee et al., "Site-specific Integration of 
Mycobacteriophage L5: Integration-proficient Vectors for 
Mycobacterium smegmatis, Mycobacterium tuberculosis, and 
bacille Calmette-Guerin" , pages' 3111-3115, see entire document. 

Journal of General Virology, Volume 26, No. 1, issued January . 
1975, J.A. Hewitt, "Miniphage - a Class of Satellite Phage to 
M13", pages 87-94, see abstract. 

Journal of Bacteriology, Volume 149, No. 3, issued March 1982, 
M.J. Orbach et al., "Transfer of Chimeric Plasmids among 
Salmonella typhimurium Strains by P22 Transduction", pages 985- 
994, see entire document. 

Zentralbl. Veterinaermed., Reihe B, Volume 25, No. 5, issued 
1982, R. Weiss et al., "Resistance Testing of Bacteria by Firefly 
Bioluminescence. A Rapid Test*, pages 359-71, see abstract. 

Fortschr. Veterinaermed., Volume 35, issued 1982, R. Weiss et 
al., "Bioluminescent Methods to Test the Antibiotic Sensitivity of 
Bacteria", pages 323-328, see abstract. 



1-47 



1-47 



1-47 



5, 6 



5, 6 



37-47 



37-47 



Form PCT/IS A/210 (continuation f second sheet)(July 1992)* 



